I have a problem in my web crawler where I am trying to retrieve images from a particular website. Problem is that often I see images that are exactly same but different in
You can compare images using:
1) simple pixel by pixel comparison
It will not give very good results when there is some shift, rotation, illumination change, ...
2) Relatively simple but more advanced approach
http://www.lac.inpe.br/JIPCookbook/6050-howto-compareimages.jsp
3) More advanced algorithms
For example RadpiMiner and IMMI extension contains several image comparison algorithms, you can experiment with different approaches and select, which suits you best for your purpose...