python - 从字符串 python 中删除 \xe2\x80\xa6

Question

我有很多 txt 文件，我需要替换其中的一些文本。几乎所有人都有这个non-ascii角色（我以为是"..."，但是……不一样）我试过replace()但我做不到，我需要一些帮助！提前致谢

score 4 · Accepted Answer

如果您使用codecs.open()打开文件，那么您将获得所有字符串为unicodes，这更容易处理。

score 2 · Accepted Answer

2

使用 unicode 类型的字符串。例如，

>>> print u'\xe2'.replace(u'\xe2','a')
a

于 2012-08-17T06:30:01.803 回答

score -1 · Accepted Answer

问题是这些字符无效str，它们是unicode.

import re
re.sub(r'<string to repleace>','',text,re.U)

大多数其他答案也可以

3 回答 3