tags-夕颜的回答
tags
I want to extract texts from HTML page(s) which placed in p and li tags, so I can start to tokenize the page to construct inverted index(es) for ea
p
li
String testText1 = d.select("body").text(); System.out.println(testText);
or
String testText2 = d.select("body p").text(); System.out.println(testText);
You can use this for getting the text from tags.