Solved

Optical Matching/ OCR

Posted on 2011-02-14
11
1,096 Views
Last Modified: 2012-05-11
I am trying to find a API that would match two images I have and give a % similarity and also even match text on the two images being compared.
0
Comment
Question by:surajguptha
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 6
  • 5
11 Comments
 
LVL 37

Accepted Solution

by:
TommySzalapski earned 500 total points
ID: 34891101
OpenCV is a powerful image processing library. I would recommend it for most of those types of applications. It's in C/C++. (It has a C++ wrapper, but all the real code runs in C so it's very fast).
http://opencv.willowgarage.com/wiki/

Here's an intro to doing OCR in OpenCV.
http://blog.damiles.com/?p=93

You would need to decide what makes images similar but code exists for many different options. I won't bother to post any since there are so many.
0
 
LVL 21

Author Comment

by:surajguptha
ID: 34900138
Thanks, I would like to use this in my .Net application. Is there any image processing library that is more suitable to use with .net and perhaps even written in .Net?
0
 
LVL 37

Expert Comment

by:TommySzalapski
ID: 34900244
You don't want image processing written in .NET. It's too slow. What you really want is image processing code written in C with .NET wrappers around it so you can call the routines from .NET.

Fortunately for you, you are not the only one who wanted this. Emgu CV is exactly what you are looking for I think. You can write all your code in .NET, but the behind-the-scenes code for the processing will run in efficient and fast C code.
http://www.emgu.com/wiki/index.php/Main_Page
0
How Do You Stack Up Against Your Peers?

With today’s modern enterprise so dependent on digital infrastructures, the impact of major incidents has increased dramatically. Grab the report now to gain insight into how your organization ranks against your peers and learn best-in-class strategies to resolve incidents.

 
LVL 21

Author Comment

by:surajguptha
ID: 34900642
It is indeed an awesome library for image processing but I will have to develop a lot of intelligence teaching this system about a ton of languages I want my software to support. Is there perhaps some other software for just OCR that is aware of a group of languages?

Thanks
0
 
LVL 37

Assisted Solution

by:TommySzalapski
TommySzalapski earned 500 total points
ID: 34900858
Tesseract-OCR is Google's open source OCR solution.
http://code.google.com/p/tesseract-ocr/
There are a lot of people working on it for many different languages and scripts.
Agian, there already exists a .NET wrapper for it called tessnet2
http://www.pixel-technology.com/freeware/tessnet2/
0
 
LVL 21

Author Comment

by:surajguptha
ID: 34902232
Thanks Tommy! It looks very promising.
I tried downloading it and when I launched the Demo application, it crashes. It was heart breaking :P
0
 
LVL 37

Expert Comment

by:TommySzalapski
ID: 34902313
Hmm... Make sure all the dlls are in the right folders (Tessnet2.dll). Probably needs to be in the same folder as the .exe or a folder referenced in your %PATH% system variable.
I assume you have the needed runtimes already installed.
0
 
LVL 21

Author Comment

by:surajguptha
ID: 34902523
Yes, I tried every combination of folder structure and tried putting the files everywhere just in case I was doing wrong. The moment I click on "OCR", the button that is supposed to convert, it crashes! No events in event viewer, nothing. Just dies.
0
 
LVL 37

Expert Comment

by:TommySzalapski
ID: 34902701
I guess the next step would be to see if you can compile a quick test project. If that works, who needs the demo? Maybe they referenced some weird runtime functions or something that your computer doesn't have.
0
 
LVL 21

Author Comment

by:surajguptha
ID: 34902773
I actually did. I took the source of the demo application and compiled it and then used the newly generated exe. But did reuse the existing .net wrapper since I did not have VC++ to recompile it on my machine.
0
 
LVL 21

Author Comment

by:surajguptha
ID: 34903270
Works fine! Had to change some folder structures!! Thats all!
0

Featured Post

Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article discusses the PaperPort 14 Scanner Connection Tool, which Nuance provides at no charge in order to fix scanning problems in Windows 8. Furthermore, users of PaperPort 14 in Windows 7 and Windows 10 have reported that the tool works in t…
PaperPort is a popular document imaging/management product from Nuance Communications (http://www.nuance.com/). It is in widespread use by both individuals (http://www.nuance.com/for-individuals/by-product/paperport/index.htm) and businesses (http:/…
We often encounter PDF files that are pure images, that is, they do not have text characters, but instead contain only raster graphics. The most common causes of this are document scanning software and faxing software/services that create image-only…
This video Micro Tutorial is the first in a two-part series that shows how to create and use custom scanning profiles in Nuance's PaperPort 14.5 (http://www.experts-exchange.com/articles/17490/). But the ability to create custom scanning profiles al…

749 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question