我正在阅读网页中的内容,然后使用Jsoup解析器帮助解析它,以仅获取正文部分中存在的超链接。我得到的输出:从Java中获取给定字符串的子串
<a href="/sports/sports.asp" style="TEXT-DECORATION: NONE"><font color="#0000FF">Sports</font></a>
<a href="/titanic/titanic.asp" style="TEXT-DECORATION: NONE"><font color="#0000FF">Titanic</font></a>
<a href="gastheft.asp" onmouseover="window.status='License Plate Theft';return true" onmouseout="window.status='';return true">license plates</a>
<a href="miracle.asp" onmouseover="window.status='Miracle Cars';return true" onmouseout="window.status='';return true">miracle cars</a>
<a href="/crime/warnings/clear.asp" onmouseover="window.status='Clear Loss';return true" onmouseout="window.status='';return true" target="clear">Clear</a>
and even more hyperlinks.
从所有的人,所有我感兴趣的是像
/sports/sports.asp
/titanic/titanic.asp
gastheft.asp
miracle.asp
/crime/warnings/clear.asp
我怎样才能做到这一点使用字符串或有任何其他方式或方法将数据使用Jsoup Parser本身提取这些信息?
http://jsoup.org/cookbook/extracting-data/attributes-text-html – helderdarocha