• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 1132
  • Last Modified:

Optical Matching/ OCR

I am trying to find a API that would match two images I have and give a % similarity and also even match text on the two images being compared.
0
surajguptha
Asked:
surajguptha
  • 6
  • 5
2 Solutions
 
TommySzalapskiCommented:
OpenCV is a powerful image processing library. I would recommend it for most of those types of applications. It's in C/C++. (It has a C++ wrapper, but all the real code runs in C so it's very fast).
http://opencv.willowgarage.com/wiki/

Here's an intro to doing OCR in OpenCV.
http://blog.damiles.com/?p=93

You would need to decide what makes images similar but code exists for many different options. I won't bother to post any since there are so many.
0
 
surajgupthaAuthor Commented:
Thanks, I would like to use this in my .Net application. Is there any image processing library that is more suitable to use with .net and perhaps even written in .Net?
0
 
TommySzalapskiCommented:
You don't want image processing written in .NET. It's too slow. What you really want is image processing code written in C with .NET wrappers around it so you can call the routines from .NET.

Fortunately for you, you are not the only one who wanted this. Emgu CV is exactly what you are looking for I think. You can write all your code in .NET, but the behind-the-scenes code for the processing will run in efficient and fast C code.
http://www.emgu.com/wiki/index.php/Main_Page
0
Keep up with what's happening at Experts Exchange!

Sign up to receive Decoded, a new monthly digest with product updates, feature release info, continuing education opportunities, and more.

 
surajgupthaAuthor Commented:
It is indeed an awesome library for image processing but I will have to develop a lot of intelligence teaching this system about a ton of languages I want my software to support. Is there perhaps some other software for just OCR that is aware of a group of languages?

Thanks
0
 
TommySzalapskiCommented:
Tesseract-OCR is Google's open source OCR solution.
http://code.google.com/p/tesseract-ocr/
There are a lot of people working on it for many different languages and scripts.
Agian, there already exists a .NET wrapper for it called tessnet2
http://www.pixel-technology.com/freeware/tessnet2/
0
 
surajgupthaAuthor Commented:
Thanks Tommy! It looks very promising.
I tried downloading it and when I launched the Demo application, it crashes. It was heart breaking :P
0
 
TommySzalapskiCommented:
Hmm... Make sure all the dlls are in the right folders (Tessnet2.dll). Probably needs to be in the same folder as the .exe or a folder referenced in your %PATH% system variable.
I assume you have the needed runtimes already installed.
0
 
surajgupthaAuthor Commented:
Yes, I tried every combination of folder structure and tried putting the files everywhere just in case I was doing wrong. The moment I click on "OCR", the button that is supposed to convert, it crashes! No events in event viewer, nothing. Just dies.
0
 
TommySzalapskiCommented:
I guess the next step would be to see if you can compile a quick test project. If that works, who needs the demo? Maybe they referenced some weird runtime functions or something that your computer doesn't have.
0
 
surajgupthaAuthor Commented:
I actually did. I took the source of the demo application and compiled it and then used the newly generated exe. But did reuse the existing .net wrapper since I did not have VC++ to recompile it on my machine.
0
 
surajgupthaAuthor Commented:
Works fine! Had to change some folder structures!! Thats all!
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Get your problem seen by more experts

Be seen. Boost your question’s priority for more expert views and faster solutions

  • 6
  • 5
Tackle projects and never again get stuck behind a technical roadblock.
Join Now