There's a windows app called CheckWeb. Its no longer developed, but it works well, and the code is open (C++ I believe).
You just give it a url, and it will crawl your site (and external links if you choose), reporting any errors, image / page "weight" etc.
http://www.algonet.se/~hubbabub/how-to/checkweben.html