python - 在Python中替换英文字母之外的任何字符？

Question

如何替换英文字母表之外的任何字符？

例如，用 ' ' 替换的 'abcdükl*m' 将是 'abcd kl m'

score 6 · Accepted Answer

使用正则表达式[^a-zA-Z]：

re.sub(r'[^a-zA-Z]', '', mystring)

一些信息：是a-zA-Z分别表示所有小写和大写字母的字符范围，^字符类开头的插入符号表示否定，例如“除这些之外的任何内容”。

score 2 · Accepted Answer

unicodedata有一种normalize可以优雅地为您降级文本的方法：

import unicodedata
def gracefully_degrade_to_ascii( text ):
    return unicodedata.normalize('NFKD',text).encode('ascii','ignore')

如果您只是尝试去除非 ASCII 字符，那么其他人提到的否定字符集正则表达式就是这样做的方法。

score 1 · Accepted Answer

1

搜索[^a-zA-Z]并替换为“”

于 2012-10-25T01:16:38.983 回答

score 1 · Accepted Answer

>>> import string
>>> print ''.join(x if x in string.ascii_letters else ' ' for x in u'abcdükl*m') 
abcd kl m

4 回答 4