我有一个奇怪的 API,它只接受gbk
参数,我在 中捕获数据Windows
IE browser
,使用以下命令显示数据:
$ cat 12_Request.txt| iconv -f GBK -t UTF-8
GET http://10.202.15.197:20176/?user_id=1&query_type=GEOSPLIT&address=广东省深圳市宝安&ret_splitinfo=1 HTTP/1.1
Accept: text/html, application/xhtml+xml, */*
Accept-Language: zh-CN
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; Trident/7.0; rv:11.0) like Gecko
Accept-Encoding: gzip, deflate
Host: 10.202.15.197:20176
DNT: 1
Connection: Keep-Alive
如您所见,我的数据编码为GBK
. 然后我像这样发送数据netcat
:
$ cat 12_Request.txt| nc 10.202.15.197 20176 | iconv -f GBK -t utf-8 # right
HTTP/1.0 200 OK
Content-Type: application/octet-stream
Connection: close
Content-Length: 222
<?xml version='1.0' encoding='GBK'?>
<addrSplitInfo>
<status>0</status><as_info prop="1" level="1">广东省</as_info>
<as_info prop="1" level="2">深圳市</as_info>
<as_info prop="3" level="18">宝安</as_info>
</addrSplitInfo>
但如果我用 发送数据UTF-8
,我会得到错误的响应:
$ cat 12_Request.txt | iconv -f GBK -t utf-8 | nc 10.202.15.197 20176 # wrong
HTTP/1.0 200 OK
Content-Type: application/octet-stream
Connection: close
Content-Length: 152
<?xml version='1.0' encoding='GBK'?>
<addrSplitInfo>
<status>0</status><as_info prop="6" level="13">广东省深圳市宝安</as_info>
</addrSplitInfo>
我试图这样发送cURL
:
curl -v \
--header 'Accept: text/html, application/xhtml+xml, */*' \
--header 'Accept-Language: zh-CN' \
-A 'Mozilla/5.0 (Windows NT 6.1; WOW64; Trident/7.0; rv:11.0) like Gecko' \
--header 'Accept-Encoding: gzip, deflate' \
--header 'Host: 10.202.15.197:20176' \
--header 'DNT: 1' \
--header 'Connection: Keep-Alive' \
http://10.202.15.197:20176?user_id=1&query_type=GEOSPLIT&address=广东省深圳市宝安&ret_splitinfo=1
这不起作用,它得到如下响应:
<?xml version='1.0' encoding='GBK'?>
<addrSplitInfo>
<status>0</status><as_info prop="6" level="13">广东省深圳市宝安</as_info>
</addrSplitInfo>
然后,与data-urlencode
:
curl \
--header 'Accept: text/html, application/xhtml+xml, */*' \
--header 'Accept-Language: zh-CN' \
-A 'Mozilla/5.0 (Windows NT 6.1; WOW64; Trident/7.0; rv:11.0) like Gecko' \
--header 'Accept-Encoding: gzip, deflate' \
--header 'Host: 10.202.15.197:20176' \
--header 'DNT: 1' \
--header 'Connection: Keep-Alive' \
http://10.202.15.197:20176 --data-urlencode 'user_id=1&query_type=GEOSPLIT&address=广东省深圳市宝安&ret_splitinfo=1'
这不起作用,它会得到这样的响应:
<?xml version='1.0' encoding='GBK'?>
<searchresult><status>1</status><count>0</count>
</searchresult>
我也试过用python,
>>> import requests
>>> url = u"http://10.202.15.197:20176?user_id=1&query_type=GEOSPLIT&address=广东省深圳市宝安&ret_splitinfo=1"
>>> r = requests.get(url.encode('utf-8').decode('gbk'))
>>> print r.text
这不起作用,响应如下:
u'<?xml version=\'1.0\' encoding=\'GBK\'?>\n<addrSplitInfo>\n<status>0</status><as_info prop="6" level="13">\xe9\u015e\x9e\u013a\xb8\xe7\u0179\u02d8\xe9\x90\u015e\xe4\u02dd\u0161\xe7\u0161\x81\xe9\x8d)\x86\u02db\xe7\u0164\u015b\xe7\x80\u0161\u0107\u017c\x86\xe7\x95\xa8</as_info>\n</addrSplitInfo>\n'