php - 如何让 curl 将这些 v 字符解码为 UTF-8 或 ascii

Question

我正在使用cURL从这样的网站中提取数据：

function get_data($url)
{
$ch = curl_init();
$timeout = 7;
$userAgent = "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; .NET CLR 1.1.4322)";
curl_setopt($ch, CURLOPT_USERAGENT, $userAgent);
curl_setopt($ch,CURLOPT_URL,$url);
curl_setopt($ch,CURLOPT_RETURNTRANSFER,1);
curl_setopt($ch,CURLOPT_CONNECTTIMEOUT,$timeout);
curl_setopt($ch, CURLOPT_ENCODING, "UTF-8");
$data = curl_exec($ch);
curl_close($ch);
return $data;
}

curl 将确切的字符返回为“ vnenna”，浏览器将显示为“vnenna”。

问题是：如何找到这些字符并重新格式化内容以正确显示？

score 3 · Accepted Answer

这不是 cUrl 问题。您看到的只是 HTML 实体编码的字符。用于html_entity_decode()解码它们。

见http://php.net/manual/en/function.html-entity-decode.php

php - 如何让 curl 将这些 v 字符解码为 UTF-8 或 ascii

1 回答 1

Related

Reference