Link to home
Start Free TrialLog in
Avatar of jaipur07
jaipur07

asked on

How to compare 2 images for similarity?

Hi All,

I have 100000 tiff images in a folder. There are many duplicate images with different names. I want to compare images for duplication.

Could somebody help me in that?
I heard that there are utilities to compare in binary form...

ASKER CERTIFIED SOLUTION
Avatar of Richard Quadling
Richard Quadling
Flag of United Kingdom of Great Britain and Northern Ireland image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of jaipur07
jaipur07

ASKER

Thanks Richard

I would appriciate if you can point something in Java
Ah. Not my strong point at all. I've not done Java.

But getting this script running  on windows would take around 2 minutes.

Maybe creating a pointer question in the Java section to this one would be of use.

Watch out for anyone saying you have to compare every file with every other file.

You don't.

The md5 hash is good enough to determine similarity.

So you only need to pass through the files once.
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Yes. I agree that an md5 would only provide a match where the BINARY is identical. It would NOT make any allowance for the content of the image.

SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
No objections.