html - 使用 PHP SIMPLE HTML DOM Parser 从 RSS-feed 解析 img

Question

我正在尝试解析此站点（以获取 img 链接）：http ://statigr.am/feed/parishilton

这是我的代码：

include 'parse/simple_html_dom.php';

// Create DOM from URL or file
$html = file_get_html('http://statigr.am/feed/parishilton/');

// Find all images
foreach($html->find('img') as $element)
{
       echo $element->src . '<br>';
}

该脚本不返回任何内容！这是为什么？我要img链接。

score 0 · Accepted Answer

这是因为所有图像都在CDATA部分内并且解析器忽略它，所以解决方案是

$html = file_get_html('http://statigr.am/feed/parishilton/');
$html = str_replace("<![CDATA[","",$html); // clean-up
$html = str_replace("]]>","",$html); // clean-up
$html = str_get_html($html); // re-construct the dom object
// Loop
foreach($html->find('item description img') as $el)
{
    echo $el->src . "<br />";
}

从返回的内容中替换所有CDATA内容，然后用于从该字符串str_get_html创建DOM对象并遍历图像。（经过测试和工作）。

输出：

http://distilleryimage3.s3.amazonaws.com/cc25d8562c9611e3a8b922000a1f8ac2_8.jpg
http://distilleryimage7.s3.amazonaws.com/4d8e22da2c8911e3a6a022000ae81e78_8.jpg
http://distilleryimage5.s3.amazonaws.com/ce6aa38a2be711e391ae22000ae9112d_8.jpg
http://distilleryimage3.s3.amazonaws.com/d64ab4c42bc811e39cbd22000a1fafdb_8.jpg
......
......

html - 使用 PHP SIMPLE HTML DOM Parser 从 RSS-feed 解析 img

1 回答 1

Related

Reference