PaperLight by DEKALOGIC
PaperLight is a unconventional document management application (for image and PDF files), developed as an extension of Windows Operating System. PaperLight innovates all aspects of classic approach in document management software to offer you power with unbelievable simplicity.
PaperLight can also be used for easy adding of metadata to photos, art files and so on. Metadata are indexed allowing instant retrieval of such image files by using keywords search.
- User Interface
- Metadata Viewer
- Floating Viewer
- OCR approach
- Metadata approach
- File transfers
User Interface: our new approach
Working with document files either scanned paper or photos/artwork means in most cases also working with other applications.Therefore, our interfaces were designed to facilitate simultaneous work with any other installed application. PaperLight means document management organically embedded into your Windows.
PaperLight provides all needed commands at just one click away from you. New options are unfolding gradually, in a logical and natural way, without crowding your work-space. And you can view, pause/resume or cancel any ongoing PaperLight process at any time. Nothing is irreversible or beyond your control.
Each interface element is optimised for its purpose. For example, search means a box not a window, viewer is focused on image not on space-consuming toolbars, the Control Center needs just one click on tray icon to be displayed, search results are zoomable thumbs not lists of filenames. And so on. We've even made 2 different viewers.
We've made sure that whatever you might need from PaperLight - it's always in the right place at the right moment. Get as much power as you need. And if you don't need it - you don't even see it.
Metadata Viewer: our new approach
Our philosophy is that text and image are like "yin" and "yang": they are complementary opposites forming together a greater whole. Metadata Viewer is a fully-featured image and PDF files viewer dedicated to offer you access to the image-text continuum. Via an innovative interface.
You would be surprised how much metainformation image files can contain. Image editing applications like Photoshop or hardware such as photo cameras, scanners, etc save a great deal of information which is vital for professionals in respective domains. Thanks to Phil Harvey's EXIFtool, if a file contain any metadata at all, no matter how it was generated or under which standard it was saved, you will see it. Not some of them. All of them.
Add your own metadata either by usual text editing techniques or by capturing text from any of your other installed applications. You can capture text currently under mouse cursor or when making a text selection in any external application. Or you can select a running application and everything you type exclusively in its active window will be added as metadata to currently displayed image in Metadata Viewer.
Text from OCR
PaperLight saves OCR information (recognised text and character position info) in the file itself. So you will never have to OCR same file again. You can search text and see found words highlighted on the image. You can manually correct OCR recognition mistakes and still keep the highlight feature working. Not only you but anyone you share the file with. Even someone having free version of PaperLight.
Floating Viewer: our new approach
Floating Viewer was designed to allow handling image and PDF files the same way you handle paper. Magic paper. It shows you nothing but the image. Image is entirely zoomable or zoomable within a defined frame. Access its options using keyboard shortcuts, right-click menu or floating toolbar. It is fully-featured : zoom, rotate, jump to page, find text and highlight found words, full screen, etc.
You can write in any application while reading from Viewer, it stays on top if you want, where you want and at the size you want. You can open multiple instances of same multipage PDF or TIFF file to read/compare different pages of same document. Find text inside document and get found words highlighted, either if it is a text-based PDF or a file OCRed with PaperLight.
Due to its unique way to display, Floating Viewer does not open files at their default sizes because, unlike usual viewers, too small or too big default sizes would be bothering to handle. So when opening the image or PDF it calculates optimum opening size. Then it's totally up to you.
Each file is opened in a separate instance of the viewer. So when you keep opening new files, they get at first automatically stacked allowing you can handle with ease. And since they all look like paper, the currently active file shows you for a short while the filename and its current page.
OCR: our new approach
OCR text, OCR language and character position info are saved in the image file itself. This simple and robust approach means you will never have to OCR same file again, unless it is your express will. It also means you (and anyone else you shared the file with) will be able to search text inside that document and view found words highlighted on the image. And even manually correct OCR mistakes while still keeping highlight feature available.
PaperLight analyses each file and automatically determines whether it requires OCR or not. Text-based PDF files get text extracted on-the-fly by direct parsing, photos are identified if specific original metadata exists and previously OCRed files are identified due to saved text. A double-windowed thumb viewer shows analysis results, allowing you to manually change OCR status using drag-and-drop.
You can set OCR language for all files or on a per-file basis and that's it! Then mind your other business, as PaperLight sends files to OCR engine, each one being OCRed in the language you've set and when finished, the resulted text info is saved inside the image file and the index gets automatically updated. PaperLight notifies you about everything.
Tesseract and ABBYY FineReader
Default OCR engine of PaperLight is royalty-free Tesseract installed locally on your computer. But you might also be interested to test our ABBYY FineReader OCR engine integration : we provide an online OCR demo limited to 10 pages.
Metadata approach: a new concept
Metadata Viewer allows you to associate any text with the opened image or PDF file and save this info inside the file itself with extreme simplicity. With PaperLight you can edit, copy/paste, capture metadata but you can also chose to simply remove the PaperLight metadata structure from file or save it as XML before removing. PaperLight indexes only its specific metadata structure and skips all other in order to prevent overwhelming search results.
Simpler is better
Saving all text (including OCR text info) inside the image/PDF file itself means that PaperLight don't need to store image and text in different places, under different file formats and don't need the complex code required to maintain correlation between them. It also means that processed files (metadata and OCR) become shareable because wherever the image goes, the text goes with it.
Cheaper and safer
But there are more advantages: once OCRed a file stays OCRed. Or : there is no difference between storing/writting to DVD an archive of processed documents (metadata and OCR) and storing a simple collection of unprocessed image files. With PaperLight it's simply the same.
Towards cross platform
PDF and most important image formats are cross-platform. PaperLight saves metadata and OCRed text in an open (not proprietary) standard format. And you can use the totally portable EXIFtool by Phil Harvey to access them. Not exactly same way as with PaperLight, but stay tuned : our to-do list include PaperLight versions for other platforms!
Indexing: our new approach
PaperLight indexing is done on per-folder basis, by analysing all document files (images & PDFs) inside the folder. PaperLight indexes metadata added through it as well as full text (if apppliable) via an automated process described below.
Any folder containing image and PDF files can be indexed (full-text and metadata). After text-extraction and indexing are completed, PaperLight adds a subfolder called "PaperLight Data" in the indexed folder.
Automatic analysis of files
In order to optimize indexing workflow, files are inspected to determine whether they need OCR or not.Already OCRed files and photos are not sent to OCR. Text based PDFs are also excluded from OCR as the text is being extracted by direct parsing. User can assing OCR language separately for each file requiring OCR.
Real-time index update
Folders that have been indexed with PaperLight are being monitored so when new files are added or relocated or deleted, the index is automatically updated in real time.
Searching: our new approach
PaperLight search-related interfaces are reduced to the minimum. Searching is done through custom selectable indexes (per-folder based). Documents are displayed in PaperLight Floating Viewer and keywords are highlighted on the displayed image.
PaperLight uses a (heavily modified) C++ version of Lucene, thus allowing users to perform complex query statements such as wildcard searches, proximity searches, fuzzy searches, range searches, etc.
Per-folder indexing allows per-index searching meaning you are able to select in which folders the search should be executed. Search results are displayed in a single but multi tabbed window (one tab for each searched index that returns at least one result)
File transfers: our new approach
With PaperLight, file transfer is done via a hybrid approach which combines advantages from instant messaging and email approaches.
Instant messaging - like
PaperLight allows users to create one (or multiple) user account(s) with customizable profile and have a list of contacts, same as in Instant Messaging apps. You can click on a Contact from your list then drag and drop document files (image and PDF files only) in the window. Files will be shown as thumbs for convenience. Clicking the SEND button will upload files to DEKALOGIC server. And here the similiarities with IM approach stops.
But from here, the similarities with email approach starts : your Contact don't need to be online in PaperLight simultaneously with you. He will receive notification about files sent to him next time he runs PaperLight. He will be able to chose whether to download them or reject them. Full history of file transfers with per-file status is available. Files being succesfully downloaded are deleted from server. Files that were not downloaded in 30 days from sending are automatically deleted from server.
PDF virtual printer
As PaperLight deals with image and PDF files only, we provided a PDF virtual printer allowing you to create PDFs out of other file formats.
This conversion makes other file types available for PaperLight processing.
PaperLight allows you to capture any text on your screen being under your mouse cursor, no matter the application on screen to which the text belongs. With a simple press of a button, the text is automatically added as metadata to the currently opened document in PaperLight.
This is useful when you want to add data existing in any external app as metadata to a certain document opened in PaperLight.
Suppose you are an accountant viewing new documents with PaperLight and needing to enter data from the scanned document (image on screen) to your accounting-specialised application.
You can select the app to be monitored by PaperLight so each time you enter data from your keyboard the same data will simultaneously added as metadata to the displayed image as well.
Text selection capture
PaperLight allows you to add text selections from any external application as metadata to currently opened document in PaperLight, sparing you to select then copy then paste then save. Just select and save.
All commands are available from Windows Explorer context menus. PaperLight displays only commands that makes sense and only when needed. No esoteric wizards, no overwhelming lists of options, everything is simple and intuitive.
Full automation with full control
File analysis, text extraction by direct parsing, OCR, indexing, index updates, everything is fully automated. And still you keep full control : nothing is done without your OK. Also, you can view all ongoing processes via the Control Center and pause/resume or cancel any of them.
Scanned paper documents
Create document archives on a per-folder basis (image and PDF files). Add metadata and/or send to OCR. Files not requiring OCR are automatically separated. OCR only once: resulting text is saved in the file itself. Select appropriate OCR language on a per-file basis.
Photos and artwork files
Manage your photos/artwork collections via metadata. Easily add/modify your own (indexable) metadata to the file. And also view all existing metadata, no matter who and how added it (photo camera, graphics editing software, etc.)
PaperLight provides secure files-transfer services between its users (image and PDF files only). Also, besides locally installed Tesseract (the default OCR engine of PaperLight) you can test our ABBYY FineReader OCR engine integration online (demo is limited to 10 pages).
Evaluation license provides fully featured product with limited online service access. After evaluation expires, PaperLight turns into free edition* . Payed licenses can be configured to match your exact needs. No more, no less.
- Multithreaded application
- Full Unicode support
- Convert-to-PDF (PDF virtual printer)
- Capture any text currently under mouse and add as metadata
- Capture keyboard for selected application and add as metadata
- Capture text selections from any external app and add as metadata
- Files are handled as zoomable thumbs
- Extracts text by direct parsing from text-based PDF files
- Find and highlight feature for textPDF and OCRed files
- Automatic OCR status diagnosis
- Set OCR language per-file
- Index on per-folder basis
- Search on per-index selection
- Supports boolean, wildcard, proximity, range, fuzzy, etc. searches
- Highlight found keywords in Viewer
- and much more
PaperLight runs on all Windows platforms, XP or later (32 -bit or 64-bit).
DEKALOGIC is a small and compact team working hard for just one goal : to surpass itself.
We are doing what we like until we get to like what we did.
Then we realize it's not good enough, so...it's a never-ending story.
DEKALOGIC is a French-Romanian Company founded in 2009.