数据我使用下面的代码作为输入到DOM文档滤波器提取使用XPath
<li id="SalesRank">
<b>Amazon Best Sellers Rank:</b>
#20,267 Paid in Kindle Store (
<a href="http://www.amazon.com/gp/bestsellers/digital-text/ref=pd_dp_ts_kstore_1/190-9295683-0277616">See Top 100 Paid in Kindle Store</a>
)
<ul class="zg_hrsr">
<li class="zg_hrsr_item">
<span class="zg_hrsr_rank">#15</span>
<span class="zg_hrsr_ladder">
in
<a href="http://www.amazon.com/gp/bestsellers/digital-text/ref=pd_zg_hrsr_kstore_1_1">Kindle Store</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/154606011">Kindle eBooks</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/157325011">Nonfiction</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/292975011">Lifestyle & Home</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156699011">Home & Garden</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156828011">Gardening & Horticulture</a>
>
<b>
<a href="http://rads.stackoverflow.com/amzn/click/156847011">Greenhouses</a>
</b>
</span>
</li>
<li class="zg_hrsr_item">
<span class="zg_hrsr_rank">#26</span>
<span class="zg_hrsr_ladder">
in
<a href="http://www.amazon.com/gp/bestsellers/digital-text/ref=pd_zg_hrsr_kstore_2_1">Kindle Store</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/154606011">Kindle eBooks</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/157325011">Nonfiction</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/292975011">Lifestyle & Home</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156699011">Home & Garden</a>
>
<a href="http://rads.stackoverflow.com/amzn/click/156828011">Gardening & Horticulture</a>
>
<b>
<a href="http://rads.stackoverflow.com/amzn/click/156849011">House Plants</a>
</b>
</span>
</li>
</ul></li>
我使用以下XPath查询以提取与数据的textContent ..
$xpath_cat->query('//li[@id="SalesRank"]');
可以检查输出,它包括所有li
标签中包含的数据与id=salrsrank...
,而我只想得到#20,267 paid in kindle store..
因此所需的输出
#20,267在Kindle商店
付费如何修改我的XPath来获得所需要的输出?
在代码
更新我试图溶液下文提供和使用的XPath
$xpath_cat->query('//li[@id="SalesRank"]/text()');
但现在,输出是
([0] => [1 ] =>#20,267在Kindle商店支付([2] =>)
我该如何解决这个问题?
仍然在失败...... – 2012-02-12 21:04:12