Mguesser is a tool to guess a text's character set and language. It is a standalone part of the mnoGoSearch engine. More than 100 various character set and language combinations are supported.
|Tags||Text Processing Filters General Linguistic|
Release Notes: The -d command line option was added to load language maps from a non-default directory. A colon-separated list of directories is also supported. The -t command line option was added to specify how many top n-grams to print into the output map. The default value is 200, which can be decreased for better performance or increased for better detection quality. About 30 new model maps were added.
Release Notes: The ability to create new language maps was added. Azerbaijani UTF-8 language maps were added. Other minor enhancements were made.
No changes have been submitted for this release.