python - scrapy 的响应状态为 400 ，但浏览器响应正常吗？

翻译自：https://stackoverflow.com/questions/27424600 2014-12-11T13:58:25.787

1128 次

我有这种奇怪的情况，

我有一个适用于我目前拥有的所有浏览器的链接（chrome,IE,firefox），我尝试使用scrapyin抓取页面python。但是我得到了response.status == 400，我tor + polipo习惯于匿名爬行

response.body是：

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html><head>
<title>Proxy error: 400 Couldn't parse URL.</title>
</head><body>
<h1>400 Couldn't parse URL</h1>
<p>The following error occurred while trying to access <strong>https://exmpale.com/blah</strong>:<br><br>
<strong>400 Couldn't parse URL</strong></p>
<hr>Generated Thu, 11 Dec 2014 13:55:38 UTC by Polipo on <em>localhost:8123</em>.
</body></html>

我只是想知道为什么会这样，是不是浏览器可以得到结果但不能scrapy？

python - scrapy 的响应状态为 400 ，但浏览器响应正常吗？

0 回答 0

Related

Reference