问题
I want to write a Java code to search in a certain website "news paper website". Each result in the website appear in a specified URL. How can I start? Is there a good library to use? Could I benefit from your experience in this field?
回答1:
Search for "web crawler" and you'll find many examples (e.g. Crawler4J or Crawler), how to solve this.
Besides Java, you'll often stumple upon Python when it comes to grepping stuff from web pages - I'm not a Python guy, but it seems to fit for the task.
来源:https://stackoverflow.com/questions/6826858/a-java-program-to-search-in-a-certain-website