Java Web Crawler Libraries

前端 未结 12 1086
栀梦
栀梦 2020-12-13 04:58

I wanted to make a Java based web crawler for an experiment. I heard that making a Web Crawler in Java was the way to go if this is your first time. However, I have two impo

12条回答
  •  悲哀的现实
    2020-12-13 05:36

    Right now there is a inclusion of many java based HTML parser that support visiting and parsing the HTML pages.

    • Jsoup
    • Jaunt API
    • HtmlCleaner
    • JTidy
    • NekoHTML
    • TagSoup

    Here's the complete list of HTML parser with basic comparison.

提交回复
热议问题