Use wget, do a recursive web suck, which will dump all the files onto your harddrive, then write another script to go through all the downloaded files and analyze them.
Edit: or maybe curl instead of wget, but I am not familiar with curl, I do not know if it does recursive downloads like wget.