找不到标签和正则表达式BeautifulSoup

给出的例子HTML字符串找不到标签和正则表达式BeautifulSoup

<table> 
<tr> 
<td class="td" height="25">Upstream Power</td> 
<td class="td">25.2 dBmV</td> 
<td class="td">49.2 dBmV</td> 
</tr> 
</table>

我可以用查找文本：

soup.find_all(text=re.compile("Power"))

但要找到整个标签没有找到任何东西。我错过了什么？

soup.find_all("td",text=re.compile("Power"))

来源

2012-03-22 CptanPanic

在BS3，该方法是findAll不find_all：

>>> markup = '''<table> 
... <tr> 
... <td class="td" height="25">Upstream Power</td> 
... <td class="td">25.2 dBmV</td> 
... <td class="td">49.2 dBmV</td> 
... </tr> 
... </table>''' 
>>> from BeautifulSoup import BeautifulSoup as bs 
>>> soup = bs(markup) 
>>> import re 
>>> soup.findAll(text=re.compile('Power')) 
... [u'Upstream Power']

编辑：我看到，该方法was renamed in BS4。这似乎为我工作确定：

>>> from bs4 import BeautifulSoup as bs 
>>> soup = bs(markup) 
>>> soup.find_all(text=re.compile('Power')) 
... [u'Upstream Power']

EDIT2：为了使导航解析树更容易，你可以use tag names：

>>> soup.td.find_all(text=re.compile('Power')) 
... [u'Upstream Power']

来源

2012-03-22 18:06:37 bernie

是soup.find_all（文= re.compile（”电源'））也适用于我，但为什么不soup.find_all（“td”，text = re.compile（“Power”））。我需要这样的东西，因为我想从这一点开始遍历树。 – CptanPanic 2012-03-22 19:24:10

@CptanPanic：啊，我错过了。我用更好的方式编辑答案，以获得你想要的元素。 – bernie 2012-03-22 19:38:37

找不到标签和正则表达式BeautifulSoup

回答

相关问题