如何解析文件？

以下代码解析字符串。如何解析文件？

import lxml.html 

doc = lxml.html.document_fromstring("""<html> 
<body> 
    <span class="simple_text">One</span> tehfghhxt</br> 
    <span class="cyrillic_text">Второй</span> cyrcyrcyr</br> 
</body> 
</html> 
""") 

txt1 = doc.xpath('/html/body/span[@class="simple_text"]/text()[1]') 
print(txt1)

我想知道如何解析包含相同字符串的文件。并在同一个目录中。请告诉我。

来源

2014-02-25 Sergey

从某种意义上说，问题很明显，从初学者可以预料什么。这个答案在http://lxml.de/parsing.html#parsing-html的'lxml'文档中明确显示。 'StringIO（）'是如何使字符串充当开放文件的方式。您可以将打开的文件对象传递给'parse（）'函数。它甚至可能会接受文件的名称，并会为您打开它。 – pepr

string = open('file_with_string.html', 'r').read() 
doc = lxml.html.document_fromstring(string)

来源

2014-02-25 17:54:19 chishaku

如何解析文件？

回答

相关问题