Libxslt is a C library for GNOME which allows developers to work with XSLT. It is based on libxml for XML parsing, tree manipulation, and XPath support. Also included is 'xsltproc', a command line XSLT processor. The library is written in plain C, making as few assumptions as possible, and sticking closely to ANSI C/POSIX for easy embedding. It should work on Linux, Unix, and Windows. Though not designed primarily with performances in mind, libxslt seems to be a relatively fast processor. It also include full support for the EXSLT set of extension functions as well as some common extensions present in other XSLT engines.
The Functional XML Parsing Framework is a package of low-to-high-level lexing and parsing procedures that can be combined to yield a SAX, DOM, validating parsers, or a parser intended for a particular document type. The procedures in the package can be used separately to tokenize or parse various pieces of XML documents. The package supports XML namespaces, character, internal, and external parsed entities, xml:space, attribute value normalization, processing instructions and CDATA sections. It is intended to be a framework, a set of "Lego blocks" you can use to build a parser that follows DOM, SAX, or another discipline, and performs validation to any degree. As an example of such parser construction, the package includes a semi-validating SXML parser. It converts XML to SXML, an instance of XML Infoset as S-expressions, an abstract syntax tree of an XML document. SXML can be queried (in a XPath style), transformed, and evaluated. The framework parses XML in a pure functional style, as folding over a text XML document considered a spread-out tree. The input port is treated as a linear, read-once parameter. The framework's code does not use assignments at all.
rtf2latex2e is an utility to convert RTF files from popular word processors to the LaTeX2e format for those who value quality typesetting. rtf2latex2e uses the RTF reader package by Paul DuBois to parse the RTF code and generates an appropriate LaTeX file. The main features of rtf2latex2e are: detects text style: bold, italic, color, big, small,... reads embedded figures: PICT, WMF, PNG, JPEG,... reads tables: simple to semi- complex equations: reads out Equation Editor equations (courtesy Steve Swanson, http://www.mackichan.com) symbols: converts most greek and math symbols reads footnotes (not in tables yet), and converts hypertext links using the hyperref package.
Linguaphile is a simple command line language translator. It is open source, platform independent, and programmed in Perl. Linguaphile currently supports the following languages: Afrikaans, Alawa, Albanian, Arrernte, Basque, Belarusian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, German, Greek, Hawaiian, Hungarian, Icelandic, Indonesian, Interlingua, Irish, Italian, Kala Lagaw Ya, Korean, Kriol, Latvian, Lithuanian, Malay, Maltese, Maori, Norwegian, Pitjantjatjara, Polish, Portuguese, Romanian, Russian, Samoan, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Thai, Tok Pisin, Turkish, Ukrainian, Warlpiri, and Welsh. The Spanish to English translation is the most useful at this stage.
Xerces2 is the next generation Apache Xerces-J XML parser. This version of Xerces-J defines the Xerces Native Interface (XNI), and provides a complete, standards compliant reference implementation using XNI. The parser was completely re-designed and re-implemented to be simpler and more modular. To help contribute to this project, visit the Apache Xerces Project Webpage (http://xerces.apache.org) and join the firstname.lastname@example.org mailing list.
Vilistextum is a small and fast HTML to text converter. It is quite fault-tolerant and deals well with badly-formed or otherwise quirky HTML. It has full support for different character sets (e.g. Unicode). It is able to optimize for ebook reading, collapse multiple blank lines, and create footnotes out of links. A GUI frontend using kaptain is included.