2015-10-12 255 views
2

我想获得“href”,但不能。如何从标签获得“href”

我有这样的代码:

<h3 class="list-item-hd"> 
    <a href="/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009">Dwight Yoakam and Jack Black to Produce New Nashville Comedy</a> 
</h3> 

我怎样才能得到?

/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009 

回答

1

该代码使用CSS选择器来获取所有锚元素和打印他们href属性:

require 'nokogiri' 

html = <<EOT 
<html> 
    <h3 class="list-item-hd"> 
     <a href="/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009">Dwight Yoakam and Jack Black to Produce New Nashville Comedy</a> 
    </h3> 
</html> 
EOT 

doc = Nokogiri::HTML(html) 
doc.css('a').map { |link| p link['href'] } 
# => "/music/news/dwight-yoakam-and-jack-black-to-produce-new-nashville-comedy-20151009" 

您可以使用CSS或XPath选择通过或者实现自己的目标。要了解有关CSS和XPath选择器的更多信息,我强烈建议您查看“Official Nokogiri Documentation for Searching a XML/HTML Document.