我想从第一个笑话都<p>
要素所以基本上我做了这个脚本:PHP从网站提取数据
<?php
$url = "http://sms.hindijokes.co";
$html = file_get_contents($url);
$doc = new DOMDocument;
$doc->strictErrorChecking = false;
$doc->recover = true;
@$doc->loadHTML("<html><body>".$html."
</body> </html>");
$xpath = new DOMXPath($doc);
$query1 = "//h2[@class='entry-title']/a";
$query2 = "//div[@class='entry-content']/p";
$entries1 = $xpath->query($query1);
$entries2 = $xpath->query($query2);
$var1 = $entries1->item(0)->textContent;
$var2 = $entries2->item(0)->textContent;
echo "$var1";
echo "<br>";
$f = 5;
for($i = 0; $i < $f; $i++){
echo $entries2->item($i)->textContent."\n";
}
?>
这一次我知道,有在第一个笑话,但如果有五个<p>
元素我希望它是自动化的脚本,有时会多于或少于五个元素,因此会造成混乱。
请参阅http://stackoverflow.com/questions/6366351/getting-dom-elements-by-classname –