asked on

Extract data from Scanned Images

We receive hundreds of pdf scanned images in different formats; I know it might be generic and there might be different solutions, as its small firm, what are the options available, I aware there are tools like amazon textract, but they are very expensive. We are ready to put effort if we have to build something but don't know how at the moment.

Seeking help/ guidance from experts the best way to categorize and extract information from the files

David Johnson, CD

there are a lot of pdf OCR programs out there
Adobe Acrobat, ABBYY Finereader https://pdf.abbyy.com/ to name a few

Kimputer

The best was always Nuance (now Kofax):

https://www.kofax.com/Products/paperport/professional

Quickly scan, quickly index, and you (and other on the network) can search all the scanned document whenever you want)

Nirvana

ASKER

Thank you, are there are any open source or can be build

Kimputer

Yes it can. It will always lack support. Any technical issues, and you're on your own. Installing and configuring takes a lot of time, you need to read a LOT of documents. You will have to dive REALLY deep into how all the components work and how they interact with each other. Even top IT experts will need a lot of time.
Sometimes, it's not as streamlined and intuitive as the "professional" products as recommended above.

Example: https://www.openkm.com/en/comparison-of-versions.html
https://www.krystaldms.in/comparison.php
(Obviously, only the Community Edition is free)

https://www.papermerge.com/pricing

This question needs an answer!

Become an EE member today

7 DAY FREE TRIAL

Members can start a 7-Day Free trial then enjoy unlimited access to the platform.

View membership options

Learn why we charge membership fees

We get it - no one likes a content blocker. Take one extra minute and find out why we block content.