Link to home
Start Free TrialLog in
Avatar of Nirvana
NirvanaFlag for India

asked on

Extract data from Scanned Images

We receive hundreds of pdf scanned images in different formats; I know it might be generic and there might be different solutions, as its small firm, what are the options available, I aware there are tools like amazon textract, but they are very expensive. We are ready to put effort if we have to build something but don't know how at the moment.

Seeking help/ guidance from experts the best way to categorize and extract information from the files
Avatar of David Johnson, CD
David Johnson, CD
Flag of Canada image

there are a lot of pdf OCR programs out there
Adobe Acrobat, ABBYY Finereader https://pdf.abbyy.com/ to name a few
Avatar of Kimputer
Kimputer

The best was always Nuance (now Kofax):

https://www.kofax.com/Products/paperport/professional

Quickly scan, quickly index, and you (and other on the network) can search all the scanned document whenever you want)
Avatar of Nirvana

ASKER

Thank you, are there are any open source or can be build 
Yes it can. It will always lack support. Any technical issues, and you're on your own. Installing and configuring takes a lot of time, you need to read a LOT of documents. You will have to dive REALLY deep into how all the components work and how they interact with each other. Even top IT experts will need a lot of time.
Sometimes, it's not as streamlined and intuitive as the "professional" products as recommended above.

Example: https://www.openkm.com/en/comparison-of-versions.html
https://www.krystaldms.in/comparison.php
(Obviously, only the Community Edition is free)

https://www.papermerge.com/pricing


This question needs an answer!
Become an EE member today
7 DAY FREE TRIAL
Members can start a 7-Day Free trial then enjoy unlimited access to the platform.
View membership options
or
Learn why we charge membership fees
We get it - no one likes a content blocker. Take one extra minute and find out why we block content.