0
这是我一起工作:过滤器/排除的XPath提取
<div class="Pictures zoom">
<a title="Productname 1" class="zoomThumbActive" rel="{gallery: 'gallery1', smallimage: '/images/2.24198/little_one.jpeg', largeimage: '/images/76.24561/big-one-picture.jpeg'}" href="javascript:void(0)" style="border-width:inherit;">
<img title="Productname 1" src="/images/24.245/mini-doge-picture.jpeg" alt="" /></a>
<a title="Productname 1" rel="{gallery: 'gallery1', smallimage: '/images/2.24203/small_one.jpeg', largeimage: '/images/9.5664/very-big-one-picture.jpeg'}" href="javascript:void(0)" style="border-width:inherit;">
<img title="Productname 1" src="/images/22.999/this-picture-is-very-small.jpeg" alt="" /></a>
<div>
使用以下XPath:
/html//div[@class='Pictures zoom']/a/@rel
输出变为:
{gallery: 'gallery1', smallimage: '/images/2.24198/little_one.jpeg', largeimage: '/images/76.24561/big-one-picture.jpeg'}
{gallery: 'gallery1', smallimage: '/images/2.24203/small_one.jpeg', largeimage: '/images/9.5664/very-big-one-picture.jpeg'}
是否有可能过滤提取,所以intread以上,我只得到这些:
/images/76.24561/big-one-picture.jpeg
/images/9.5664/very-big-one-picture.jpeg
我只想把一切都砍你不想要的部分,并
刘康使用和substring-after
largeimage: '
之间'}
最好的问候,
可悲的是,我不能使用XPath 2.0,但是这是最适合我的。谢谢! –