120 projects tagged "Indexing"

Download Website Updated 19 Aug 2003 ViIndex

Screenshot
Pop 15.56
Vit 1.00

ViIndex is an indexer program with a flexible and powerful indexer, a finder that allows you to use a combination of AND, OR, a regular expression, and the relative places of the keywords, and a powerful displayer with filter functionality.

Download Website Updated 19 Mar 2009 Multivalent PDF Tools

Screenshot
Pop 168.75
Vit 2.41

The Multivalent PDF Tools is a suite of tools for manipulating PDF documents. It includes tools for compressing, uncompressing (for hand editing), obtaining metadata, splitting and merging, encrypting and decrypting, validating, imposition (aka n-up), making page images, extracting text, and full-text indexing (with Lucene). The compress tool shrinks the PDF 1.5 Reference from 13.5MB to 8MB in PDF 1.5/Acrobat 6 format and down to 5.1MB in a new proposed "Compact" format.

Download Website Updated 18 Oct 2009 isbnsearch

Screenshot
Pop 115.24
Vit 2.98

isbnsearch provides a simple method for retrieving information about any book using only an ISBN or EAN barcode. It is intended to provide assistance for online libraries, user groups, or individual users, and is designed in such a way to provide a distributed ISBN database query system. Users can choose to view the summary information (author, title, publisher, date, edition, subject, ISBN) as HTML, XML, or a pre-formatted SQL statement.

Download Website Updated 25 Dec 2005 Estraier

Screenshot
Pop 125.57
Vit 6.68

Estraier is a full-text search system for personal use. Its principal purpose is to realize a full-text search system for a Web site. It functions similarly to Google, but for a personal Web site or sites in an intranet. It has fast searching, conspicuous results, relational document search, the ability to handle Japanese text, and support for handling a large number of documents. Installation is easy.

Download Website Updated 25 Jan 2010 DataparkSearch

Screenshot
Pop 128.86
Vit 9.65

DataparkSearch is a Web search engine tool. It features support for http, https, ftp, nntp, and news URLs, htdb virtual URL support for indexing SQL databases, text/html, text/xml, text/plain, audio/mpeg (MP3), and image/gif mime types built-in support, external parsers support for other document types, the ability to index multilangual sites using content negotiation, searching of all of the word forms using ispell affixes and dictionaries, stopwords and synonyms lists, boolean query language support, results sorting by relevancy, popularity rank, last modified time, and importance (a multiplication of the relevancy and popularity ranks), support for various character sets, and phrases segmenting for the Chinese, Japanese, Korean, and Thai languages. It has accent-insensitive search, mod_dpsearch for Apache, and support for internationalized domain names.

Download Website Updated 15 May 2004 StringSearch

Screenshot
Pop 64.82
Vit 1.75

The StringSearch library provides implementations of algorithms of the Boyer-Moore family and the Shift-Or (bit-parallel) family, for use in Java programs that need fast string searching algorithms.

Download Website Updated 14 Mar 2004 The Lucene Application Layer

Screenshot
Pop 26.61
Vit 1.00

LUALA is an acronym for LUcene Application LAyer. It is an intermediate level API for document indexing and searching. It uses the low-level API of Lucene.

Download Website Updated 15 Mar 2005 Ellogon

Screenshot
Pop 51.70
Vit 1.82

Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.

Download No website Updated 28 Jan 2006 The Revisionist

Screenshot
Pop 46.45
Vit 1.56

The Revisionist is a tool for extracting and indexing hidden metadata (such as deleted or modified text) from large collections of MS Word files. It can operate whole Web sites or SMB or NFS directories. It is handy for pen-testing, or it can be used just to spot embarrassing secrets.

Download Website Updated 27 Apr 2005 POPsearch

Screenshot
Pop 72.11
Vit 4.93

POPsearch is a desktop search engine that is designed to help you easily find information on your computer. With features that other search engines don't have,it lets you index your entire collection of email messages and files. As information is indexed, it is immediately available for analysis from any Web browser. When POPsearch is configured correctly, you can also access your data remotely with RSS feeds, email feeds, or from any computer that has a Web browser.

Screenshot

Project Spotlight

Lilblue Linux

An XFCE4 desktop system built on uClibc.

Screenshot

Project Spotlight

Devel Live CD

A Live CD to compile programs.