JWKTL (Java-based Wiktionary Library) is an application programming interface for the free multilingual online dictionary Wiktionary. Wiktionary is collaboratively constructed by volunteers and continually growing. JWKTL enables efficient and structured access to the information encoded in the English, German, and Russian Wiktionary language editions, including sense definitions, part of speech tags, etymology, example sentences, translations, semantic relations, and many other lexical information types.
DKPro Lab is a lightweight framework for parameter sweeping experiments. It allows the user to set up experiments consisting of multiple interdependent tasks in a declarative manner with minimal overhead. Parameters are injected into tasks using via annotated class fields. Data produced by a task for any particular parameter configuration is stored and re-used whenever possible to avoid needless recalculation of results. Reports can be attached to each task to post-process the experimental results and present them in a convenient manner (e.g., as tables or charts).
Pixy is a declarative graph query language that works on any Blueprints-compatible graph database. The Blueprints API provides a common interface to over a dozen graph databases, including Neo4J, Titan, Infinite Graph, OrientDB, DEX, and Bitsy. Pixy enables complex pattern matching and logic programming on graph databases by translating PROLOG-style rules and goals to Gremlin pipelines that represent graph traversal operations. Queries in Pixy can also be seemlessly integrated into Gremlin expressions. The combination of Pixy and Gremlin gives the programmer the full power of two graph querying paradigms, viz. graph pattern matching and graph traversal.
DKPro WSD provides UIMA components which encapsulate corpus readers, linguistic annotators, lexical semantic resources, WSD algorithms, and evaluation and reporting tools. You configure the components, or write new ones, and arrange them into a data processing pipeline. DKPro WSD is modular and flexible. Components which provide the same functionality can be freely swapped. You can easily run the same algorithm on different data sets, or test several different algorithms on the same data set.
Apache uimaFIT provides Java annotations for describing UIMA components which can be used to directly describe the UIMA components in Java code without the need for traditional UIMA XML descriptors. This greatly simplifies refactoring a component definition (e.g., changing a configuration parameter name). It also makes it easy to instantiate UIMA components without using XML descriptor files by providing convenient factory methods. It is ideal for testing UIMA components because the component can be easily instantiated and invoked without requiring a descriptor file to be created first.
Hatteras is a business events subscription engine which makes up one component of the Fogcutter Suite. It works with Quoddy to provide the ability for users to create subscriptions to business events on the organization's ESB infrastructure. It connects to Quoddy, downloads all defined subscriptions, then listens for matching messages. Messages which match a subscription are persisted to an XML database, and Hatteras then sends a notification to Quoddy which creates a subscription item record which can be rendered in the user's stream. Quoddy and Hatteras thereby provide seamless access to important business events, alongside other import pieces of content the user has selected.
ClodHopper is a Java library for high-performance clustering of numerical data. It contains clustering implementations such as K-Means, K-Means++, X-Means, G-Means, Fuzzy C-Means, Jarvis-Patrick, and various forms of hierarchical clustering. ClodHopper's clustering implementations take advantage of the host system's concurrent processing ability to speed clustering. The data structures are also very lean to conserve memory usage. ClodHopper is very extensible. If you are developing a new clustering algorithm, you may save yourself an enormous amount of work by extending a ClodHopper base class.