kio-clucene is a KDE search ioslave that uses clucene, a C++ implementation of lucene, a well-known full-featured text search engine library. kio-clucene gives any KDE application friendly access to searching the content of files inside directories, archives, and many virtual file systems. Optical character recognition (OCR) is supported by using a program like gocr, allowing the text content of images to be indexed.
lamos is PHP-based software for bookmark management and exchange. Links/bookmarks for multiple users are stored in a database and can be retrieved and added via a Web frontend. lamos supports hierarchical categories for links and includes a simple search function. The idea is to allow synchronization of bookmarks between different browsers, systems, and places with a single service that can be accessed from everywhere.
lftpsearch is a set of Perl scripts that are searching for files and directories on FTP servers. There is the searcher, the indexer for getting the lists of all the files and directories at remote FTP servers, and the onliner for checking whether FTP servers are online. It supports "+" (or " ") and "*" in search queries, page splitting at search results, searching on online servers only, searching for files/directories only, searching for size limited files, caching, and Russian names. It also shows some statistics on servers (files/directories amount and total size). Initially, it was created to be used over LANs.
The Full-text Index Data structure library, libfid for short, is a portable software library for accessing indexed data through a simple C interface. It implements, among others, functions for reading indexed data from files, and for performing common operations such as fast string matching. Easy alphabet handling for mapping between printable and binary alphabets is integrated from the ground up. Currently, the enhanced suffix array is the only full-text index data structure supported. A very simplistic program for constructing enhanced suffix arrays is included.
locust is a full featured Internet search engine specifically designed to power vertical search, enterprise search, or a knowledge area search applications. It can index 2.5 million documents per 24 hours on a single Dell server. It consists of clean C++/STL code written from scratch.
lookuplet is a GNOME utility that allows you to easily enter search terms (it automatically grabs the X selection when invoked) and then press a key combination which will cause those terms to be substituted into a URL which is launched using your preferred Web browser, or to be substituted into a command line which is invoked. It is rather useful for things like dictionary lookups, Web searches, map lookups, etc. The idea is to shave seconds off of queries that you find yourself doing all of the time. Though it uses the GNOME libraries, it doesn't require that you be running the GNOME desktop environment.
minimoz consists of a tool to import a dmoz.org database RDF dump into a local database and a Web server for database access. Browsing as well as searching the database is possible. The Web page layout is template driven and quite customizable, including support for content negotiation.
mkESA is a program for constructing enhanced suffix arrays (ESAs) from biological sequence data. The program is based on an implementation of Manzini's lightweight Deep-Shallow algorithm, which can also utilize multiple CPUs/cores for extra performance. The generated output is compatible with the output of mkvtree from the Vmatch package.
mnoGoSearch is a full-featured Web search engine that you can use to build search engines over HTTP, HTTPS, FTP, and NTTP servers, local files, and database text fields. It has built-in support for XML, HTML, eml, and text files. External converters can be used to support other document types. An automatic language/charset guesser for more 70 language/charset combinations is included, along with cluster support, making it possible to distribute data over multiple computers and index huge collections with up to several hundred million documents.