0
假设$html_dom
包含一个具有HTML实体的页面,如 
。在下面的输出中,我得到这样的输出 
。PHP的HTML DOM,XPATH - 奇怪的字符?
$html_dom = new DOMDocument();
@$html_dom->loadHTML($html_doc);
$xpath = new DOMXPath($html_dom);
$query = '//div[@class="foo"]/div/p';
$my_foos = $xpath->query($query_abstract);
foreach ($my_foos as $my_foo)
{
echo html_entity_decode($my_foos->nodeValue);
die;
}
我如何妥善处理这让我没有得到怪异字符?我尝试没有成功如下:
$html_doc = mb_convert_encoding($html_doc, 'HTML-ENTITIES', 'UTF-8');
$html_dom = new DOMDocument();
$html_dom->resolveExternals = TRUE;
@$html_dom->loadHTML($html_doc);
$xpath = new DOMXPath($html_dom);
$query = '//div[@class="foo"]/div/p';
$my_foos = $xpath->query($query);
foreach ($my_foos as $my_foo)
{
echo html_entity_decode($my_foos->nodeValue);
die;
}
证实它有效。谢谢。 – StackOverflowNewbie