php - 如何在 PHP 中使用 snoopy 类？

Question

我是 php 的初学者，我正在制作简单的程序，并且使用了一些抓取网站（不是私人信息）。我期望的结果是 HTML CODE，就像一个

<html><head><title>blabla blabla</title></head>...................

但我检查了结果，屏幕出现了。不是原始代码，例如，

include "Snoopy.class.php";
$snoopy = new Snoopy;

$snoopy->fetch("http://stackoverflow.com/");
echo $snoopy->results;

如何获取 HTML 代码的信息？你有另一个很好的 PHP 解析库吗？（就像 Python 上的 beautifulsoup 和 Java 上的 Jsoup）

**以上代码的结果：不是html代码，而是屏幕**

不是html源代码，而是屏幕。

score 1 · Accepted Answer

要使用浏览器查看源代码而不是渲染 HTML，您的最后一行应该是：

echo htmlspecialchars($snoopy->results);

score 0 · Accepted Answer

如果你想从 URL 中获取 html，你可以通过 php 的file_get_contents函数简单地做到这一点。

$url = 'http://stackoverflow.com/';
$html = file_get_contents($url);
// echo $url -> wrong
echo $html;

score 0 · Accepted Answer

这很简单

     // Add snoopy class and initiate it
     require "snoopy/Snoopy.class.php";
     $snoopy = new Snoopy;         

    // THis fetches the html
    $snoopy->fetch("http://www.php.net/");
                  $text = $snoopy->results;
    // This fetches the text with html tags stripped
    $snoopy->fetchtext("http://www.php.net/");
                  $text = $snoopy->results;
    // This fetches all the links
    $snoopy->fetchlinks('http://www.php.net/');
    $linksarray = $snoopy->results;

史努比对我来说很棒。所以希望有帮助

php - 如何在 PHP 中使用 snoopy 类？

3 回答 3

Related

Reference