php - 处理简单 html dom 中的错误

Question

我有一些代码来获取我从网站获取的一些公共可用数据

//Array of params

foreach($params as $par){

$html = file_get_html('WEBSITE.COM/$par');

$name = $html->find('div[class=name]');
$link = $html->find('div[class=secondName]');

foreach($link as $i => $result2)
{

$var = $name[$i]->plaintext;
echo $result2->href,"<br>";
//Insert to database
} 
}

因此，每次在循环中，它都会在 URL 中使用不同的参数进入给定的网站，当 404 出现或服务器暂时不可用时，我不断收到破坏脚本的错误。我已经尝试过代码来检查标题并首先检查 $html 是否是一个对象，但我仍然得到错误，有没有办法我可以跳过错误并将它们排除在外并继续执行脚本？

我试图检查标题的代码

function url_exists($url){
if ((strpos($url, "http")) === false) $url = "http://" . $url;
$headers = @get_headers($url);
//print_r($headers);
if (is_array($headers)){
//Check for http error here....should add checks for other errors too...
if(strpos($headers[0], '404 Not Found'))
    return false;
else
    return true;    
}         
else
return false;
}

我试图检查对象的代码

if (method_exists($html,"find")) {
 // then check if the html element exists to avoid trying to parse non-html
 if ($html->find('html')) {
      // and only then start searching (and manipulating) the dom

score 1 · Accepted Answer

你需要更具体，你得到什么样的错误？哪一行出错了？

编辑：由于您确实指定了您遇到的错误，因此可以执行以下操作：

我注意到您使用单引号和包含变量的字符串。这不起作用，请改用双引号，即：

$html = file_get_html("WEBSITE.COM/$par");

也许这就是问题所在？

此外，您可以使用file_get_contents()

if (file_get_contents("WEBSITE.COM/$par") !== false) {
  ...
}

php - 处理简单 html dom 中的错误

1 回答 1

Related

Reference