Internationalized Domain Names (IDNs).
IDNs use characters drawn from a large repertoire (Unicode), but IDNA
allows the non-ASCII characters to be represented using only the ASCII
characters already allowed in so-called host names today (letter-digit-
hypen, "/[A-Z0-9-]/i").
tools may provide very nice user interfaces, but tend to produce
horrible XML. If you have to read these ugly products, you are in for
pain. The purpose of this module (and the script "xmlrewrite" which is
part of this distribution) is to be able to rewrite XML messages and
Schema's into something maintainable.
The main difference between this module and other beautifiers is that
the clean-up is based on schema rules. For instance, it is permitted
to remove blanks around and inside integers, but not in strings.
Beautifiers which do not look into the schema have only limited
possibilities for cleanup, or may accidentally change the message
content.
WWW: http://search.cpan.org/dist/XML-Rewrite/
PR: ports/152658
Submitted by: Patrick Powell <papowell@astart.com>
editing a file in-place. Inspired by variations of Perl's -i option,
this module is intended for somewhat more structured and reusable
editing than command line Perl typically allows.
File::Inplace endeavors to guarantee file integrity; that is, either
all of the changes made will be saved to the file, or none will.
It also offers functionality such as backup creation, automatic
field splitting per-line, automatic chomping/unchomping, and aborting
edits partially through without affecting the original file.
WWW: http://search.cpan.org/dist/File-Inplace/
- Add textproc/uim-kde4, KDE4 panel applet of uim input method. It is a
slave port of textproc/uim.
- Remove japanese/uim-canna and japanese/uim-prime. Now the canna and prime
modules are implemented as scheme scripts and they can be installed from
textproc/uim.
- Enable helper tools installation of textproc/uim-qt4.
- Enable sj3 support of textproc/uim.
full-featured text search engine written in Java.
CLucene is faster than lucene as it is written in C++.
WWW: http://sourceforge.net/projects/clucene/
PR: ports/151951
Submitted by: Tom Judge <tom at tomjudge.com>
to their Unicode characters. A function for decoding is provided.
The mapping can be generated from a DTD file with entity definitions.
WWW: http://search.cpan.org/dist/XML-Entities/
CSS::Simple is an interface through which to read/write/manipulate CSS
files while respecting the cascade order.
WWW: http://search.cpan.org/dist/CSS-Simple
PR: ports/151798
Submitted by: Vivek Khera <vivek@khera.org>
Approved by: pgollucci (mentor, implicit)
RT is a simple and human-readable table format.
RTtool is a converter from RT into various formats.
RT can be incorporated into RD.
At this time, RTtool can convert RT into HTML and plain text.
To convert into plain text, you need w3m.
WWW: http://www.rubyist.net/~rubikitch/computer/rttool/index.en.html
It inherits Text::CSV and is aware of input/output encodings.
WWW: http://search.cpan.org/dist/Text-CSV-Encoded/
PR: ports/150710
Submitted by: Sebastien Santoro <dereckson@gmail.com>
in ePub format.
* Free and open source software under GPLv3
* Multi-platform: runs on Windows, FreeBSD, Linux and Mac
* Full Unicode support: everything you see in Sigil is in UTF-16
* Full EPUB spec support
* WYSIWYG editing
* Multiple Views: Book View, Code View and Split View
* Metadata editor with full support for all possible metadata entries
(more than 200) with full descriptions for each
* Table Of Contents editor
* Multi-level TOC support
* Book View fully supports the display of any XHTML document possible
under the OPS spec
* SVG support
* Basic XPGT support
* Advanced automatic conversion of all imported documents to Unicode
* Currently imports TXT, HTML and EPUB files; more will be added with time
* Embedded HTML Tidy; all imported documents are thoroughly cleaned;
changing views cleans the document so no matter how much you screw
up your code, it will fix it (usually)
* An actually usable user interface
WWW: http://code.google.com/p/sigil/
PR: ports/150348
Submitted by: Jonathan Chen <jonc@chen.org.nz>
all macro processors. It is still maintained and ported to a large number
of systems.
WWW: http://www.ml1.org.uk
PR: ports/150234
Submitted by: Bob Eager <rde at ml1.org.uk>
journal, making a presentation, annotating a document - including pdf - or
collaborating using a stylus, mouse or keyboard. It is similar to Microsoft
Windows Journal and to the earlier Mimeo whiteboarding and Palm notepad
applications.
WWW: http://levine.sscnet.ucla.edu/general/software/tc1000/jarnal.htm
Protocol (XMPP) written in Erlang/OTP.
Main features:
- Based on Erlang message reception and pattern matching. The programming
style is close to Erlang approach and lead to very short pieces of code.
- Support for both formating of client and server packets.
- Based on Erlang atoms and binary to limit memory consumption. It can be
used to write a highly scalable XMPP proxy or XMPP server.
- SSL support.
- Several different XML parsers can be used (expat, libxml2).
WWW: https://support.process-one.net/doc/display/EXMPP/exmpp+home
PR: ports/147135
Submitted by: Maxim Ignatenko <gelraen.ua@gmail.com>
Technically, textproc/mini-xml was added to the tree first but it has been
outdated for a while now.
PR: ports/148775
Submitted by: Bapt <baptiste.daroussin@gmail.com>
Whatpm::HTML. Changes include:
* Provides an XML::LibXML-like DOM interface. If you usually use
XML::LibXML's DOM parser, this should be a drop-in solution for tag
soup HTML.
* Constructs an XML::LibXML::Document as the result of parsing.
* Via bundling and modifications, removed external dependencies
on non-CPAN packages.
WWW: http://search.cpan.org/dist/HTML-HTML5-Parser/
PR: ports/148308
Submitted by: Ju Pengfei <jupengfei@gmail.com>
Feature safe: yes
It's built on top of Nokogiri and libxml2, so it's fast and has a nice API.
Loofah excels at HTML sanitization (XSS prevention). It includes some nice HTML
sanitizers, which are based on HTML5lib's whitelist.
WWW: http://github.com/flavorjones/loofah
WWW: http://loofah.rubyforge.org/loofah/
PR: ports/147185
Submitted by: Eric Freeman <freebsdports at chillibear.com>
native formats and transforms them into a tree. Configuration changes are made
by manipulating this tree and saving it back into native config files.
WWW: http://augeas.net
PR: ports/146743
Submitted by: Russell Jackson <raj at csub.edu>
documents as trees of elements. This model may be familiar from many other
document systems, especially the HTML DOM. Pod::Elemental's document
object model is much less sophisticated than the HTML DOM, but still makes
a lot of document transformations easy.
WWW: http://search.cpan.org/dist/Pod-Elemental/
POD2::FR. These modules belong to the Italian and the French
translation projects of core Perl pods.
WWW: http://search.cpan.org/dist/POD2-Base/
PR: ports/146974
Submitted by: Ashish SHUKLA <wahjava@gmail.com>
in a manner similar to the way the PPI package parses Perl.
WWW: http://search.cpan.org/dist/PPIx-Regexp
PR: ports/146973
Submitted by: Ashish SHUKLA <wahjava@gmail.com>
introduces a more general approach for processing XML with Haskell. The
Haskell XML Toolbox uses a generic data model for representing XML
documents, including the DTD subset and the document subset, in Haskell.
It contains a validating XML parser, a HTML parser, namespace support,
an XPath expression evaluator, an XSLT library, a RelaxNG schema
validator and funtions for serialization and deserialization of user
defined data. The library make extensive use of the arrow approach for
processing XML.
WWW: http://www.fh-wedel.de/~si/HXmlToolbox/index.html
formats using a common MODS-format XML intermediate. For example, one
can convert RIS-format files to Bibtex by doing two transformations:
RIS->MODS->Bibtex. By using a common intermediate for N formats, only 2N
programs are required and not N^2-N. These programs operate on the
command line and are styled after standard UNIX-like filters.
WWW: http://www.scripps.edu/~cdputnam/software/bibutils/