I'm trying to get raw html of crawled pages in different files, named as url of the page. Is it possible with Nutch to save the raw html pages in different files by ruling out the indexing part?
Tejas Patil
来源:https://stackoverflow.com/questions/10142592/nutch-raw-html-saving