openDIAS (Document Imaging Archive System) provides document imaging with OCR. You can scan documents (with SANE) or import office documents, then assign them tags. It can store all your letters, bills, statements, etc. in a convenient, safe, and easily retrievable way.
|Tags||Office/Business Office Suites Capture Scanners Information Management Document Repositories|
|Operating Systems||Unix POSIX Linux|
|Translations||English German Dutch|
Release Notes: This release added auto tagging by detecting similar documents. Username authentication now uses a new session system. A Dutch translation was added and a number of German translations were fixed. Scanning color and performance were improved. The UI was improved UI with print now, date formats, file uploads, and error feedback. There were lots of internal refactors and improvements, new tests, and increased test performance.
Release Notes: This release implement issues #7 (document linkage), issue #12 (better testing), and issue #13 (device locking). PDF, ODF, and image imports correctly have a thumbnail and OCR performed. Works on 64-bit machines. Migrated to tesseract v3. I/O is all UTF-8. The front end will now time out rather than hang on an error. The document list is now auto-loaded rather than using a paged table. A localization framework has been put in place (with English and German languages added). Defunct speech functionality has been removed. Various bugfixes and cleanups.
Release Notes: This release introduces major new functionality and polishes the code and user interface. Overall, it is a solid increase in functionality and quality.
Release Notes: The software was totally rewritten from the bottom up. It is now a Web based client, interfacing into a server backend that controls the SANE devices and the database.
Release Notes: Threading was added to "slick up" the UI in places. More memory leaks were fixed. Things are handled when the loading image is not available. Compiler flags are used to set the data_dir. Cleanup was done. A "no OCR libs" error was fixed. Lots of memory leaks were fixed. Bind vars are now used for database updates and inserts. The "should we OCR" setting now defaults to on. Image processing was added to allow paged views of scanned images. An icon was added to the application. A build failure bug if tesseract is not installed was fixed.