Release Notes: This release includes important bugfixes and various enhancements.
Release Notes: This version includes important bugfixes and various enhancements including HTML5 support.
Release Notes: Important bugfixes and a new stream-based parsing option allowing memory efficient processing of large files.
Release Notes: This version is a major new release that requires the Java 5 runtime or later. It introduces major API changes such as generics and enums, as well as some new features.
Release Notes: This version includes important bugfixes and the following enhancements. Non-server tags are no longer recognized inside server tags. Microsoft downlevel-revealed conditional comments are recognized. All unnecessary white space may be removed from a source document. Various other enhancements were made to existing features.
Release Notes: This version includes important bugfixes and introduces the following minor enhancements: elements inside SCRIPT elements are ignored. Encoding detection and analysis were improved. Parsing of attributes containing server tags was improved.
Release Notes: This version has been released under a dual licence system, allowing a choice between the Eclipse Public License (EPL) and the LGPL. It includes important bugfixes and introduces the following major features: simple rendering of HTML markup into text, integrated logging with various logging frameworks, and easier parsing of HTML tags containing server tags.
Release Notes: Version 2.3 includes important bug fixes as well as some minor improvements to source indenting functionality.
Release Notes: This release includes important bugfixes, and introduces the following major features: much more efficient parsing of the entire source document, the ability to indent the source document based on its element hierarchy, and the ability to navigate the document element hierarchy.
Release Notes: This version adds two main features. The first is the Segment.extractText() method, which extracts all of the text from a segment of the source document, removing all markup and collapsing whitespace. This is simple text extraction only and makes no attempt to render the markup. The second feature is the fact that new Source constructors automatically determine the encoding of an input stream. The new getEncoding() and getEncodingSpecificationInfo() methods allow the user to see which encoding was used and how it was determined.