What HTML parsing libraries do you recommend in Java [closed]

隐身守侯 提交于 2019-11-26 06:37:30

问题


I want to parse some HTML in order to find the values of some attributes/tags etc.

What HTML parsers do you recommend? Any pros and cons?


回答1:


NekoHTML, TagSoup, and JTidy will allow you to parse HTML and then process with XML tools, like XPath.




回答2:


I have tried HTML Parser which is dead simple.




回答3:


Do you need to do a full parse of the HTML? If you're just looking for specific values within the contents (a specific tag/param), then a simple regular expression might be enough, and could very well be faster.



来源:https://stackoverflow.com/questions/26638/what-html-parsing-libraries-do-you-recommend-in-java

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!