Is there a parser/library which is able to read an HTML document into a DOM tree using Java? I\'d like to use the standard DOM/Xpath API that Java provides.
DOM/Xpath
TagSoup can do what you want.