openDIAS (Document Imaging Archive System) provides document imaging with OCR. You can scan documents (with SANE) or import office documents, then assign them tags. It can store all your letters, bills, statements, etc. in a convenient, safe, and easily retrievable way
主要特性:
Scan documents (SANE). Extract the text (OCR), and use for searching or export.
Import PDF, ODF and Image files, extract images and text from these as well.
Assign tags to docs, link docs to each other, zoom in, export and print docs.
Auto detect similar documents, the application can offer to ‘tag’ and ‘title’ new docs for you.
Application is accessible from any HTTP browser, and secured behind usernames and passwords.
Application is fully localisable (currently localised into English, German and Dutch).
Published API that is fully tested.