Jericho HTML Parser is a Java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognized or invalid HTML. It also provides high-level HTML form manipulation functions.
|Tags||Text Processing Markup HTML/XHTML Software Development Libraries Java Libraries Internet Web Dynamic Content|
|Operating Systems||OS Independent|
Release Notes: This release includes important bugfixes and various enhancements.
Release Notes: This version includes important bugfixes and various enhancements including HTML5 support.
Release Notes: Important bugfixes and a new stream-based parsing option allowing memory efficient processing of large files.
Release Notes: This version is a major new release that requires the Java 5 runtime or later. It introduces major API changes such as generics and enums, as well as some new features.
Release Notes: This version includes important bugfixes and the following enhancements. Non-server tags are no longer recognized inside server tags. Microsoft downlevel-revealed conditional comments are recognized. All unnecessary white space may be removed from a source document. Various other enhancements were made to existing features.