I need to parse HTML 4 in Java. Ideally I\'d like an implementation that is SAX compatible.
I\'m aware that there are numerous HTML parsers in for Java, however, the
You can find a collection of HTML parsers here HTML Parsers. I don't remeber exactly but I think TagSoup parses the file without applying corrections...