Connexor Machinese analyzers process sequences of written words, identify and classify the various entities in them, and show how these relate to each other, marking the language with a simple and systematic notation. Currently, the Machinese product family includes: Machinese Phrase Tagger, a fast, light-weight morphosyntactic tagger; Machinese Syntax, a full-scale dependency parser; Machinese Semantics, a dependency parser with semantic analysis; and Machinese Metadata, an entity extractor.
|Tags||Text Processing Indexing Linguistic|
|Operating Systems||Windows SymbianOS POSIX Linux Solaris|
Release Notes: This version is a complete redesign of Connexor's versatile and powerful text analytics and metadata extraction solution. Machinese Metadata now provides intelligent entity recognition and classification, date and measurement normalization, the ability to use custom lexicons and ontologies, support for ten languages, and language identification.
Release Notes: This version of Machinese Phrase Tagger introduces an enhanced custom lexicon mechanism and provides various improvements in analysis.
Release Notes: This version offers language identification and noun phrase detection. A command line utility and TCP/IP socket server executable are now also available on the MS Windows platform. Program output formats have been revised: output format options include now XML output. Handling of XML input and imperfect text input has been enhanced. Sentence boundaries are now shown in the output. The language models have been updated, and speed and memory usage have been improved.
Release Notes: There are now two new supported languages: Danish and Norwegian. This version introduces a custom lexicon facility to allow users to add new vocabulary to the analyzer. The Unix/Linux versions now support output as prose instead of tags and allow new types of preformatting tokens.
Release Notes: Name identification has been further improved. General analysis quality has been improved. Danish and Norwegian have been added to the Machinese language palette. Memory consumption is now smaller. Input handling has been improved.