更新用户代理信息的正确方法是什么urllib3
?
如何检查用户代理信息是否确实已更改并正在使用?
例如:
user_agent = {'user-agent': 'Mozilla/5.0 (Windows NT 6.3; rv:36.0) Gecko/20100101 Firefox/36.0'}
http = urllib3.PoolManager(10, headers=user_agent)
r1 = http.request('GET', 'http://example.com/')
if r1.status is 200:
with open('somefile','w+') as f:
f.write(r1.data)
当我创建一个PoolManager
at 时,http
我查看了它dir(http)
,发现它http.headers
默认为空并更新为指定的用户代理信息,但它被使用了吗?有没有无需查看apache
日志即可进行检查?
/var/log/apache2/access.log
并在尝试更新用户代理后进行实际检查:
>>> import urllib3
>>> user_agent = {'user-agent': 'Mozilla/5.0 (Windows NT 6.3; rv:36.0) Gecko/20100101 Firefox/36.0'}
>>> http = urllib3.PoolManager(2, headers=user_agent)
>>> r = http.request('GET','localhost')
>>> with open('/var/log/apache2/access.log','r') as f:
... last_line = f.readlines()[-1]
...
>>> last_line
'127.0.0.1 - - [08/Dec/2014:20:42:04 -0500] "GET / HTTP/1.1" 200 461 "-" "-"\n'