Projects / jPDFText

jPDFText

jPDFText is a Java library to extract text from PDF documents. PDF documents can be processed to extract the textual content for archiving, storage, searching, or indexing. jPDFText is built on top of Qoppa's proprietary PDF technology, so there is no need for any third party software or drivers. Main Features: loading PDF documents from files, network drives, URLs, or input streams; extracting text; and extracting words as a vector of Strings. It is written entirely in Java, which allows your application to remain platform independent. There is no need to install or configure additional drivers or software when deploying.

Tags
Licenses
Operating Systems
Implementation
Translations
Screenshot

Project Spotlight

OpenStack4j

A Fluent OpenStack client API for Java.

Screenshot

Project Spotlight

TurnKey TWiki Appliance

A TWiki appliance that is easy to use and lightweight.