Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people, just like you, are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
Solved

Using OCR softwares to implement a paperless office

Posted on 2012-04-11
13
142 Views
Last Modified: 2016-09-22
Hi,

Suppose someone is shopping for office stationery and they take a photograph of the receipt which is then uploaded to a cloud (dropbox?, shoebox?) where the receipt is processed through an OCR software (omnipage pro, adobe acrobat x?) so that the content of the photographed or scanned receipt is machine readable and thus searchable, could you please advise on which softwares to use to implement this process and how to do it?

Thanks
0
Comment
Question by:PCknots
13 Comments
 
LVL 48

Assisted Solution

by:dbrunton
dbrunton earned 125 total points (awarded by participants)
ID: 37836693
Probably Paperport http://www.paperport.com/scan/

Note that you may not get 100% totally searchable documents.
0
 

Author Comment

by:PCknots
ID: 37852132
Is it able to make the images machine readable? What about photographs of receipts?
0
 
LVL 53

Assisted Solution

by:Joe Winograd, EE MVE
Joe Winograd, EE MVE earned 375 total points (awarded by participants)
ID: 37852305
The title of your question asks about implementing a paperless office, a very broad topic, but the body of it has a specific question about taking a photo of a receipt when shopping. These are very different questions. I have some thoughts on both, but before sending them along, please let me know which question you would like answered.

Btw, the answers to the questions in your last post are: (1) Yes, the two most recent versions of PaperPort (12 and 14 - there was no 13) have built-in OCR and can create a PDF Searchable Image file, which has both the image and a layer of (machine readable) text created by its OCR, which can be searched and copy/pasted. (2) Yes, it can handle photographs of receipts. It has a "Save As" function that can operate on a JPG (or almost any standard image format) and create a PDF Searchable Image file, as described in (1) above. You may learn more about PaperPort here:
http://nuance.com/for-individuals/by-product/paperport/index.htm

One other thing. Your question mentions Dropbox, ShoeBox, OmniPage Pro, and Adobe Acrobat X. Do you have these products? When you ask the question of how to implement the process, are you asking how to do it with these four products or are these just examples, and you're open to using other products? Regards, Joe
0
Portable, direct connect server access

The ATEN CV211 connects a laptop directly to any server allowing you instant access to perform data maintenance and local operations, for quick troubleshooting, updating, service and repair.

 

Author Comment

by:PCknots
ID: 37857523
Thanks Joe, The other softwares mentioned in my question means that I am open to using other softwares as well...What do you think would be the most efficient way to go about it?
0
 
LVL 53

Expert Comment

by:Joe Winograd, EE MVE
ID: 37860920
For implementing a paperless office or for taking a photo of a receipt when shopping and storing/processing it?
0
 

Author Comment

by:PCknots
ID: 37865871
The activity of taking a photo of a receipt when shopping and storing/processing it is being considered a part of paperless. My question here pertains to  taking a photo of a receipt when shopping and storing/processing it.
0
 
LVL 11

Expert Comment

by:sparab
ID: 38632701
I've requested that this question be deleted for the following reason:

The question has either no comments or not enough useful information to be called an "answer".
0
 
LVL 53

Expert Comment

by:Joe Winograd, EE MVE
ID: 38632703
I apologize for missing the author's last post...my bad. Please keep this open and I'll respond today. Thanks, Joe
0
 
LVL 53

Assisted Solution

by:Joe Winograd, EE MVE
Joe Winograd, EE MVE earned 375 total points (awarded by participants)
ID: 38634732
@PCknots, sorry I missed your last response when you posted it.

Since you've clarified that this particular question is just about taking a photo of a receipt when shopping and storing/processing it (not the much bigger question of implementing a paperless office), my advice is this:

(1) Use Dropbox to upload directly from the smartphone's camera to your Dropbox account in the cloud. I use an Android phone with the Dropbox app and all you have to do is turn on the Camera Upload feature and Dropbox will then automatically upload pictures taken. Alternatively, you can turn off the Camera Upload feature, take your photos first, and then upload them manually via the Dropbox app. I don't have an iPhone (or any iOS device), but I'm guessing that the Dropbox app works similarly on it.

(2) Set up Dropbox on your computer to sync with Dropbox in the cloud. When you get home from shopping, pictures of your receipts will have been synced to your computer.

(3) Use whatever software you'd like to OCR the pictures and create a searchable PDF file, which is a PDF file that has an image as well as a layer of text created by the OCR process. My personal favorite for doing this is Nuance's PaperPort:
http://nuance.com/for-individuals/by-product/paperport/index.htm

It's not free, but the street price of the standard edition (not the Professional edition) is very reasonable (around $20-30 these days at various online retailers, including Amazon, Buy.com, CircuitCity, and TigerDirect), and the standard edition is fine for your purposes. It also includes a Dropbox-like capability called PaperPort Anywhere. If you go with PaperPort, you may want to consider using PaperPort Anywhere instead of Dropbox, but either will do (PaperPort Anywhere has a very nice [Scan from camera] button).

If you're looking for FREE, there are numerous packages that can do the job. One is the excellent freeware package IrfanView:
http://www.irfanview.com/

Click the Download link on the left to download IrfanView and click the PlugIns link on the left to download the PlugIns, which are needed to give you PDF capability. Install IrfanView first, then install the PlugIns. For the OCR capability, there's a separate plug-in (also free):
http://irfanview.info/plugins/kadmos/setup_kadmos_irfanview_us.exe

(4) After creating a searchable PDF file from the picture of a receipt, store it wherever you want. I encourage you to create a file structure that makes sense to you, such as by year, store name, type of purchase, whatever. You should also give some thought to file-naming conventions. For example, I begin each file with YYYYMMDD (the receipt's date).

(5) Once you have searchable PDF files (i.e., PDF files with text in them from the OCR process), you can index them, and then search them quickly based on all content in the receipts. There are many search tools out there. My favorite is dtSearch, but at $199 is expensive:
http://www.dtsearch.com/

Another good search product, more reasonably priced, is X1:
http://www.x1.com/

And if you want FREE, Windows Search 4 (WS4) is built into W7 and is available as a free download for XP:
http://www.microsoft.com/en-us/download/details.aspx?id=23

I think that's it! Regards, Joe
0
 
LVL 53

Accepted Solution

by:
Joe Winograd, EE MVE earned 375 total points (awarded by participants)
ID: 41803062
There's more than enough information to confirm an answer. Since the asker clarified that this particular question is just about taking a photo of a receipt when shopping and storing/processing/OCRing it (not the much bigger question of implementing a paperless office), these answers are solutions:

#a37836693
#a37852305
#a38634732

Also, subsequent to this question of more than four years ago, I published the following articles and videos here at EE on the subject of OCRing and creating searchable files:

Convert Scanned Image-Only PDF Files to PDF Searchable Image Files via OCR with Power PDF Advanced
How to OCR pages in a PDF with free software
Batch Conversion of PDF, TIFF, and Other Image Formats via Command Line Interface to PDF, PDF Searchable, and TIFF
PaperPort - How To Create Searchable PDF Files

Regards, Joe
0
 
LVL 53

Expert Comment

by:Joe Winograd, EE MVE
ID: 41810307
The selected posts provide answers to the clarified question, i.e., handling receipts (not the much bigger question of a paperless office).
0

Featured Post

Three Reasons Why Backup is Strategic

Backup is strategic to your business because your data is strategic to your business. Without backup, your business will fail. This white paper explains why it is vital for you to design and immediately execute a backup strategy to protect 100 percent of your data.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
How to modify text in a png image 10 117
best free software for ripping cd's 11 172
Where is Outlook in Paperport 14? 8 81
Error to office online activation want to rol back 2 25
Workplace bullying has increased with the use of email and social media. Retain evidence of this with email archiving to protect your employees.
Companies keep a much closer eye on costs today, so changing to new Technology – Microsoft Office 365 is the smartest move to take.
This is used to tweak the memory usage for your computer, it is used for servers more so than workstations but just be careful editing registry settings as it may cause irreversible results. I hold no responsibility for anything you do to the regist…
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…

766 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question