出HTML中提取的值我有这一块的HTML(代码不同):不能与LXML
<span class="ng-binding"> <b>Total:</b> 68.71€ (459 items) </span>
出于此,我想提取68.71€ (459 items)
我试着用这段代码到目前为止,只是将xpath复制到上面显示的span class中,直接从Chrome浏览器中取出:
import urllib.request
from lxml import html
import os
ids = ["ftpstorage1-730",
"ftpstorage2-730",
"ftpstorage3-730"]
for id in ids:
url = 'http://steam.tools/itemvalue/#/'+id
with urllib.request.urlopen(url) as response:
site = response.read()
tree = html.fromstring(site)
data = tree.xpath('//*[@id="container"]/div[5]/span[1]/text()')
print(data)
理论上这应该工作,但它不会,所有我得到的data
是:
[" {{(items | filter:dupesFilter | filter:typeFilter | filter:filterText | sumByKey:'price':'count':
e}}\n\t\t\t\t({{items | filter:dupesFilter | filter:typeFilter | filter:filterText | sumByKey:'count
[" {{(items | filter:dupesFilter | filter:typeFilter | filter:filterText | sumByKey:'price':'count':
e}}\n\t\t\t\t({{items | filter:dupesFilter | filter:typeFilter | filter:filterText | sumByKey:'count
[" {{(items | filter:dupesFilter | filter:typeFilter | filter:filterText | sumByKey:'price':'count':
e}}\n\t\t\t\t({{items | filter:dupesFilter | filter:typeFilter | filter:filterText | sumByKey:'count
任何知道我做错了吗?
它是否与正在生成的数字有关而不是静态的?
如果是这样,我该如何提取数字?
数据不在源代码中,如果您通过'requests'获取总数,则会动态生成 –
@PadraicCunningham奖励积分。 :) – alecxe
@alecxe,你打败我吧! –