0

我在 var1 中有这个字符串

var1 = '$a=1%7Cscroll%20on%20%22Page%3A%20Generator-Sets-Construction%3Fid%3Dci%26s%3DY2l8Tj00Mjk0NzQ4MDY5KzQyOTQ5NjM4OTY%3D%22%7C-%7Cscroll%7C1443616500011%7C1443616500586%7C3774$fId=16440287_806$rId=RID_-62268720$rpId=1762047089$domR=1443616443684$time=1443616500588'

如何将字符串的内容更改为“可读”文本,即非 URL 编码。

根据研究,这是我尝试过的代码,但它仍然保留 URL 编码的项目,例如 %20 等。

import html
print(html.unescape('$a=1%7Cscroll%20on%20%22Page%3A%20Generator-Sets-      Construction%3Fid%3Dci%26s%3DY2l8Tj00Mjk0NzQ4MDY5KzQyOTQ5NjM4OTY%3D%22%7C-%7Cscroll%7C1443616500011%7C1443616500586%7C3774$fId=16440287_806$rId=RID_-62268720$rpId=1762047089$domR=1443616443684$time=1443616500588'))

感谢所有帮助,或者如果有一个现有的模块可以做到这一点。

4

1 回答 1

1

您要做的是取消引用参数字符串而不是取消转义 html。以下应该工作 -

import urllib.parse
print(urllib.parse.unquote('$a=1%7Cscroll%20on%20%22Page%3A%20Generator-Sets-      Construction%3Fid%3Dci%26s%3DY2l8Tj00Mjk0NzQ4MDY5KzQyOTQ5NjM4OTY%3D%22%7C-%7Cscroll%7C1443616500011%7C1443616500586%7C3774$fId=16440287_806$rId=RID_-62268720$rpId=1762047089$domR=1443616443684$time=1443616500588'))
于 2015-10-01T09:30:45.403 回答