1
I need to organize a large number of scanned receipts. The receipts are scanned in PDF, and I need is software that will intelligently look at a document and parse out date, location / vendor, total amount paid, etc. The key is that these fields need to extracted from each document for tabulation; it isn't enough to just have them OCRed for search purposes.
I think I am essentially looking for similar functionality as is in the NeatReceipts scanners, but without the need to do the scanning. Are there any 3rd party tools for doing this sort of specialized parsing or OCR?
Care to explain what it does and/or how this solves the problem? Just posting links as an answer is frowned upon, especially when its used to endorse your own product – Ivo Flipse – 2011-12-06T10:47:18.733
Sorry for not being specific enough. I beleive this software is just what the author is looking for, as it's made for specialized OCR of single-type documents: forms, invoices, recepiets etc. – Nikolay – 2011-12-07T07:27:28.480
And here's the copy-paste description from the link i provided:
ABBYY FlexiCapture 10 is intelligent, accurate and scalable document capture and data extraction software. It provides a single entry point to automatically transform the stream of different forms and documents of any structure and complexity to usable and accessible data ready to be exported into your business applications and databases. – Nikolay – 2011-12-07T07:28:27.430
See, this makes your post much much better and even worth an upvote :-) – Ivo Flipse – 2011-12-07T08:26:49.687
1Actually I'm not sure this really does the trick: as far as I can tell it ABBYY FlexiCapture doesn't actually parse receipts on its own. It is more of a configurable platform for enterprise-level document management (including parsing). I'm looking for something consumer / end-user focused. Thanks for the pointer tho! – Ramon – 2012-04-17T18:41:33.973