python - 如何调用 urllib2 get_header 方法？

Question

尽管RanRag or jterrace建议的方法对我来说效果很好，但我想知道如何使用urllib2.Request.get_header方法来实现相同的效果。所以，我尝试了下面的代码行：

>>> import urllib2
>>> req_info = urllib2.Request('http://mirror01.th.ifl.net/releases//precise/ubuntu-12.04-desktop-i386.iso')
>>> req_info.header_items()
[]
>>> req_info.get_header('Content-Length')
>>>

因为，您可以看到get_header没有返回，也没有header_items。

那么，使用上述方法的正确方法是什么？

score 3 · Accepted Answer

该类urllib2.Request只是“一个 URL 请求的抽象”（http://docs.python.org/library/urllib2.html#urllib2.Request），并不做任何实际的数据检索。您必须使用urllib2.urlopen来检索数据。urlopen要么直接将 url 作为字符串，要么你也可以传递Request对象的实例。

例如：

>>> req_info = urllib2.urlopen('https://www.google.com/logos/2012/javelin-2012-hp.jpg')
>>> req_info.headers.keys()
['content-length', 'x-xss-protection', 'x-content-type-options', 'expires', 'server', 'last-modified', 'connection', 'cache-control', 'date', 'content-type']
>>> req_info.headers.getheader('Content-Length')
'52741'

python - 如何调用 urllib2 get_header 方法？

1 回答 1

Related

Reference