Can get text of a <link></link> node #51

freewind · 2010-11-22T04:36:24Z

    String html = "<link>http://www.google.com</link><link1>http://link1.com</link1>";
    Document doc = Jsoup.parse(html);
    String link = doc.select("link").first().text();
    System.out.println("Link: " + link);
    String link1 = doc.select("link1").first().text();
    System.out.println("Link1: " + link1);

The result is :

    Link: 
    Link1: http://link1.com

It seems the content of "" node is ignored

The text was updated successfully, but these errors were encountered:

jhy · 2010-11-22T05:10:04Z

That's right, link is defined as an empty tag in HTML. Jsoup parses link tags with text content the same was as the browsers do (specifically, the text appears after the node, and if the link was in the head, the text is moved to the body).

freewind · 2010-11-22T10:37:00Z

That's a pitty. I hope I can use JSoup to parse XML as well as HTML. The api is so easy to use.
Will JSoup have a version can parse and query XML files?

jhy · 2010-11-22T22:24:20Z

Yep I've been thinking about that, seems like it would be a good idea to implement.

freewind · 2010-11-23T05:03:47Z

Great, hope it soon.

hakos · 2011-01-11T05:09:48Z

Yes, it would be great with XML support.

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can get text of a <link></link> node #51

Can get text of a <link></link> node #51

freewind commented Nov 22, 2010

jhy commented Nov 22, 2010

freewind commented Nov 22, 2010

jhy commented Nov 22, 2010

freewind commented Nov 23, 2010

hakos commented Jan 11, 2011

Can get text of a <link></link> node #51

Can get text of a <link></link> node #51

Comments

freewind commented Nov 22, 2010

jhy commented Nov 22, 2010

freewind commented Nov 22, 2010

jhy commented Nov 22, 2010

freewind commented Nov 23, 2010

hakos commented Jan 11, 2011