jPDFViewer is a Java bean which embeds a PDF viewer in your Java applications and applets. It can read, display, and print PDF files, fill interactive PDF forms (acroforms, xfa forms), view all markup annotations, validate and display digital signatures, and perform text search, selection, and highlighting. It provides easy navigation with different views: thumbnails, bookmarks, annotations, etc. It has a customizable toolbar and user interface. It supports all image types, including JBIG2 and JPEG 2000, and all PDF font types (Types 0-3, OpenType, TrueType). It supports Acrobat PDF format 1.7, including layers, all PDF color spaces, including pattern and separation, and file attachments. There's no need to install or configure additional drivers or software when deploying.
jPDFText is a Java library to extract text from PDF documents. PDF documents can be processed to extract the textual content for archiving, storage, searching, or indexing. jPDFText is built on top of Qoppa's proprietary PDF technology, so there is no need for any third party software or drivers. Main Features: loading PDF documents from files, network drives, URLs, or input streams; extracting text; and extracting words as a vector of Strings. It is written entirely in Java, which allows your application to remain platform independent. There is no need to install or configure additional drivers or software when deploying.
DeltaQt is a cross-platform library of C++ classes and functions for parsing of DELTA (DEscription Language for TAxonomy) files, and is part of the Free DELTA initiative. Support for all major data-related DELTA directives is planned. Typesetting directives are beyond the scope of the project, and support for them is not planned. Support for RTF formatting within comments, notes, descriptions, etc. is planned. All data defined by supported DELTA directives (including comments and annotations) will be parsed into memory. Qt (core) is used extensively.
jWeb1T is an Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files, a simple index is used to retrieve the files containing the n-grams.
WiredTiger is an extensible platform for data management. Its storage engine is optimized for high-throughput, big data applications. It can be configured for write-optimized (row-store) or read-optimized (column-store) access, as well as a hybrid of both. It separates the on-disk and in-memory representations of data, leading to a simpler, more compact file format and a large block I/O tailored for modern storage systems.
LibU is a multiplatform C library that comes under a BSD-style license. It includes many interdependent modules for accomplishing several tasks: memory allocation, networking and URI parsing, string manipulation, debugging, and logging in a very compact way, plus many other miscellaneous tasks. It has a small footprint (about 70KB for the default configuration), it is modular, and it has a multiplatform nature, making it an ideal candidate for embedded systems. KLone is an example of such usage.