193 projects tagged "Linguistic"

Download Website Updated 25 Oct 2002 SILGraphite

Screenshot
Pop 40.25
Vit 65.07

SILGraphite (formerly OpenGraphite) is a project within SIL's Non-Roman Script Initiative and Language Software Development groups to provide extensible cross-platform rendering capabilities for complex non-Roman writing systems. It consists of a rule-based programming language, Graphite Description Language (GDL), that can be used to describe the behavior of a writing system, a compiler for that language, and a rendering engine that can serve as the backend of a text processing application. SILGraphite renders TrueType fonts that have been extended by means of compiling a GDL program. It is currently being integrated into Gecko/Mozilla through the SILA project, a GNU/Linux port is also underway, and there are plans for OpenOffice.org and Abiword integration.

Download Website Updated 03 Nov 2002 Marko

Screenshot
Pop 30.05
Vit 1.42

Marko is a simple toolset that allows you to create markov chain databases of a corpus (or two) of text and then allows you to compare unknown texts to these databases. For any two marko databases you can calculate the probability that the unknown body is related to one over the other. Possible applications include intelligent mail filtering, plagiarism detection, and historical research.

Download Website Updated 26 Mar 2006 dbacl

Screenshot
Pop 176.95
Vit 4.91

dbacl is a digramic Bayesian text classifier. Given some text, it calculates the posterior probabilities that the input resembles one of any number of previously learned document collections. It can be used to sort incoming email into arbitrary categories such as spam, work, and play, or simply to distinguish an English text from a French text. It fully supports international character sets, and uses sophisticated statistical models based on the Maximum Entropy Principle.

Download Website Updated 22 Jan 2004 Pythoñol

Screenshot
Pop 93.21
Vit 2.11

Pythoñol is an all-in-one program that helps English speakers learn Spanish. It features pronunciation, verb conjugation, a dictionary with over 70,000 words, a thesaurus, quizzes, full-text translation, idioms, a verb browser, and a large reference section.

Download No website Updated 28 Dec 2002 gendic

Screenshot
Pop 20.93
Vit 1.00

gendic generates an Arabic dictionary made of word roots from an input text file.

Download Website Updated 28 Feb 2012 Hspell

Screenshot
Pop 66.81
Vit 7.46

Hspell is a Hebrew linguistic project. It features a Hebrew spell-checker, and aims to use the databases and algorithms developed as a morphology engine (for example, for search engines), and in the future for advanced things like Hebrew speech synthesis.

No download Website Updated 24 May 2003 Biaroza

Screenshot
Pop 26.76
Vit 1.44

Biaroza is a multi-dictionary system for human languages which aims to set a standard on such type of software. It works internally (and externally if you want so) in UTF-8. The software itself supports querying by particles, customizable in/out filtering, and interface mode (for using with another software) among other features.

Download Website Updated 09 Feb 2003 DadaDodo

Screenshot
Pop 42.05
Vit 1.00

DadaDodo is a program that analyses texts for word probabilities, and then generates random cut-up sentences based on that. It is a travesty generator similar to Dissociated Press, but based on a Markov Chain of length 1.

Download Website Updated 29 Dec 2007 GNU Talk Filters

Screenshot
Pop 109.61
Vit 4.78

The GNU Talk Filters are filter programs that convert ordinary English text into text that mimics a stereotyped or otherwise humorous dialect. Some of these filters have been in the public domain for many years, but here they are provided as a single integrated package. The filters include austro, b1ff, brooklyn, chef, cockney, drawl, dubya, fudd, funetak, jethro, jive, kraut, pansy, pirate, postmodern, redneck, valspeak, and warez. This package provides the filters both as individual executables and collectively as a C library, so they can be easily embedded in other programs.

Download Website Updated 21 Feb 2003 mioReaderLite

Screenshot
Pop 9.54
Vit 1.00

mioReaderLite is a Japanese capable text reader with integrated dictionary for the Sharp Zaurus PDA. It's part of a more ambitious project, mioSuite. Due to the manual search capabilities, the program can also be used as a Japanese/English dictionary.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.