fliXXery is a simple and intuitive movie and TV series database. It helps users organize their movies and list them by ID, title, year, length, genre, filetype, etc. It allows users to define their own genres, filetypes, and languages for their movies, so they don't have to bother with unsetting pre-defined values. It features different user rights levels. It is easy to install and setup, simple to backup and restore, and supports multiple languages.
focuseek searchbox is a family of easily installable full-text search engines that can spider Internet and intranet data sources (Web sites, newsgroups, FTP sites, and others) or index data you feed to it and make it available for searching. It supports a variety of input formats (among them HTML, PDF, Microsoft Word DOC, and RTF), and is easily scriptable via SOAP and extendable through plugins. It can scale to millions of documents and comes with a full-fledged GUI client, a built in Web search portal, and an RSS server.
gonzui is a source code search engine for accelerating open source software development. In the open source software development, programmers frequently refer to source code written by others. The project's goal is to help programmers develop programs effectively by creating a source code search engine that covers vast quantities of open source codes available on the Internet.
Grub-client is a distributed crawling client, used to create an infrastructure that provides URL update status information for Web pages on the Internet. Grub's distributed crawler network will enable Web sites, content providers, and individuals to notify others that changes have occurred in their content, all in real time. Clients are ranked by the numbers of URLs that are crawled, both on their own machines and other servers.
Harvest is a system to collect information and make it searchable using a Web interface. It can collect information using HTTP, FTP, NNTP, and local files. Supported formats include HTML, DVI, PS, fulltext, mail, man pages, news, troff, WordPerfect, C sources, and many more. Adding support for new formats is easy due to Harvest's modular design.
ht://Check is a link checker derived from ht://Dig. It can retrieve information through HTTP/1.1 and store it in a MySQL database so that after a "crawl", ht://Check can return broken links, anchors not found, content-types, and HTTP status codes summaries. ht://Check also performs accessibility checks in accordance with the principles of the University of Toronto's Open Accessibility Checks (OAC) project, allowing users to discover site-wide barriers like images without proper alternatives, missing titles, etc. A PHP interface lets the user query and view the results directly via the Web.