Jsoup selector on RSS <link> tag returns empty string with .text() method

后端 未结 2 1181
孤街浪徒
孤街浪徒 2020-12-21 06:20

I\'m using jsoup to parse an rss feed using java. I\'m having problems getting a result when trying to select the first element in the document.

2条回答
  •  清酒与你
    2020-12-21 06:57

    Your rss feed is XML, not HTML. For this to work, you must tell JSoup to use its XMLParser. This will work:

    String rss = ""
      +""
      +  "The Blog Title"
      +  "http://www.the.blog/category"
      +"";
    
    Document doc = Jsoup.parse(rss, "", Parser.xmlParser());
    
    Element link = doc.select("rss channel link").first();
    System.out.println(link.text()); // prints empty string
    

    Explanation:

    The link tag in HTML follows a different format and Jsoup tries to interpret the of your rss as such html tag.

提交回复
热议问题