With MetaModel, you use a type-safe SQL-like API for querying any datastore. It is a data access framework providing a common interface for exploration and querying of different types of datastores. It isn't a data mapping framework. Instead, it emphasizes abstraction of metadata and the ability to add data sources at runtime, making MetaModel great for generic data processing applications, but less so for applications modeled around a particular domain.
Nutch is highly scalable Web searching software which builds on top of Apache Hadoop and Lucene Java. Key features include a Web crawler, indexer, crawl management tools, parsers for HTML, PDF, DOC, and several other document formats, and an expandable architecture that allows you to plug in additional functionality such as document parsers, custom scoring algorithms, custom content parsers, protocols, and more.
The Apache ODF Toolkit is a set of Java modules that allow programmatic creation, scanning, and manipulation of Open Document Format (ISO/IEC 26300 == ODF) documents. Unlike other approaches that rely on runtime manipulation of heavy-weight editors via an automation interface, the ODF Toolkit is lightweight and ideal for server use.
Apache OpenMeetings is a multi-language customizable video-conferencing and collaboration system. It supports audio/video and allows you to see the desktop of any participant. It includes a whiteboard, the ability to import a variety of image formats, invitations, a moderation system, backup and language modules, private and public conference rooms, and the ability to record meetings. The project also provides integrations for Asterisk, Jira, Confluence, Moodle, Sakai, and SugarCRM, and connectors for LDAP and ADS.
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services.
Apache PhotArk is a photo gallery application including a content repository for the images, a display piece, an access control layer, and upload capabilities. The idea is to have a rigid design for the content repository with a very flexible display piece. The images in the content repository will be protected with granular access control.
The mission of the Apache Portable Runtime (APR) project is to create and maintain software libraries that provide a predictable and consistent interface to underlying platform- specific implementations. The primary goal is to provide an API to which software developers may code and be assured of predictable if not identical behaviour regardless of the platform on which their software is built, relieving them of the need to code special-case conditions to work around or take advantage of platform-specific deficiencies or features.