0

我制作了一个基本脚本来获取 html 页面和图像。但我想在检索图片时继续保持联系。
- 我对测试脚本所做的一直是在 Web 服务器上询问地址。但它没有用。
那么我该怎么做或者我做错了什么?

基本脚本:

function fetch( $server, $url, $port = 80, &$resp_head )
{
    if ( !$con = @fsockopen($server, $port, $error_number, $error_string, 30) ) return False;
    stream_set_timeout( $con, 8600 );

    // Reguest
    $qry_header = 'GET '.$url." HTTP/1.1\r\n";
    $qry_header .= 'Host: '.$server."\r\n";
    $qry_header .= 'User-Agent: Mihtyom; (+http://'.$_SERVER['HTTP_HOST']."/)\r\n";
    $qry_header .= "Accept: */*\r\n";           //google check the 3 "Accept"
    $qry_header .= "Accept-Language: *\r\n";
    $qry_header .= "Accept-Encoding: *\r\n";
    $qry_header .= "Connection: Close\r\n\r\n";
    fwrite( $con, $qry_header );

    $inheader = True;
    $data = '';
    while (!feof( $con ))
    {
        $line = fgets( $con );
        if (!$inheader)
        {
            $data .= $line;
            continue;
        }
        if ($line == "\n" || $line == "\r\n")
        {
            $inheader = False;
            continue;
        }
        $resp_head[] = trim($line);
    }

    fclose( $con );

    return $data;
}

测试脚本:

function fetch( $server, $urls, $port = 80, &$resp_head )
{
    if ( !$con = @fsockopen($server, $port, $error_number, $error_string, 30) ) return False;
    stream_set_timeout( $con, 8600 );

    $resp_head = array();
    $data = array();
    foreach ($urls as $key => $url)
    {
        $inheader = True;
        $data[$key] = '';

        // Reguest
        $qry_header = 'GET '.$url." HTTP/1.1\r\n";
        $qry_header .= 'Host: '.$server."\r\n";
        $qry_header .= 'User-Agent: Mihtyom; (+http://'.$_SERVER['HTTP_HOST']."/)\r\n";
        $qry_header .= "Accept: */*\r\n";           //google check the 3 "Accept"
        $qry_header .= "Accept-Language: *\r\n";
        $qry_header .= "Accept-Encoding: *\r\n";
        $qry_header .= "Connection: Close\r\n\r\n";
        fwrite( $con, $qry_header );

        while (!feof( $con ))
        {
            $line = fgets( $con );
            if (!$inheader)
            {
                $data[$key] .= $line;
                continue;
            }
            if ($line == "\n" || $line == "\r\n")
            {
                $inheader = False;
                continue;
            }
            $resp_head[$key][] = trim($line);
        }
    }
    fclose( $con );
    return $data;
}

新脚本(21-07-12 12:34):

function fetch( $server, $urls, $port = 80, &$resp_head )
{
    if ( !$con = @pfsockopen($server, $port, $error_number, $error_string, 30) ) return False;
    stream_set_timeout( $con, 8600 );

    $resp_head = array();
    $data = '';
    $i = count($urls);
    foreach ($urls as $key => $url )
    {
        $i--;

        // Reguest
        $qry_header = 'GET '.$url." HTTP/1.1\r\n";
        $qry_header .= 'Host: '.$server."\r\n";
        $qry_header .= 'User-Agent: Mihtyom; (+http://'.$_SERVER['HTTP_HOST']."/)\r\n";
        $qry_header .= "Accept: */*\r\n";
        $qry_header .= "Accept-Language: *\r\n";
        $qry_header .= "Connection: ".($i == 0 ? 'Close' : 'keep-alive')."\r\n\r\n";
        fwrite( $con, $qry_header );
    }

    while (!feof( $con ))
    {
        $line = fgets( $con );
        if ($line == "\r\n\r\n" || $line == "\n\n") break;
        $data .= $line;
    }
    fclose( $con );
    return $data;
}
4

1 回答 1

1

Connection: Close在 HTTP 标头中指定,它基本上告诉服务器:“给我这个文件,然后立即关闭连接”。您实际上是在强迫连接关闭自己。

相反,您可以指定Connection: Keep-Alive,它(明确地)告诉服务器不要关闭 TCP 会话,以便您可以将其重用于另一个请求。

但是,服务器在任何情况下都不能接受 - 它可能会忽略您的请求并使用Connection: Close. 此外,连接的每一端都可以随时关闭连接和/或在(下一个)标题中告诉另一端这样做,通过Connection: Close

为了正确实现,您应该指定Connection: Close循环处理最后一个 URL 的时间。

在HTTP 1.1 / RFC 2068 第 14.10 节HTTP 1.1 / RFC 2616 第 19.6.2节中查找有关持久连接的更多详细信息。

于 2012-07-20T19:30:03.133 回答