jWeb1T is an Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files, a simple index is used to retrieve the files containing the n-grams.
|Tags||Java Library n-grams NLP Natural Language Processing|
|Operating Systems||Java Runtime Environment|
Release Notes: Cleaned up code, some bugfixes, and performance improvements.