java - Get data from table(html) except div tag by jsoup -
i have html code:
<table width="100%" cellpadding="5" cellspacing="2" class="zebra"> <tr> <td colspan="5"> <div class="paginator"> <a href="http://some_link">2</a> </div> </td> </tr> <tr> <td><a href="//i_need_only_this_link">some_value</a></td> </tr> <tr> <td><a href="//i_need_only_this_link1">some_value</a></td> </tr> <tr> <td colspan="2"> <div class="paginator"> <a href="http://some_link">2</a> </div> </td> </tr> </table>
i use jsoup. how can links except links in div tag? try this, doesn't work. element contains links.
org.jsoup.nodes.elements tabletext = doc.select("table.zebra").not("tr td div.paginator"); (org.jsoup.nodes.element td : tabletext.select("td a")) { system.out.println(td.attr("href")); // http://some_link .... }
you can use below code..
document html = jsoup.parse(htmlstr); (element e : html.getelementsbytag("a")) { if (!"div".equalsignorecase(e.parentnode().nodename())) { system.out.println(e.attr("href")); } }
here checking parent node of anchor element not div. if not div printing url.
Comments
Post a Comment