-2

I have a problem converting HTML to normal text. I'm reading some pages and those include unicode signs like \u00f3 and \u00f1, etc. I want those converted to normal ASCII (not ó and ñ but o and n).

I've tried a lot in Python, but does anyone know an easy solution?

4

1 回答 1

0

看看这个 SO 问题:在 Python unicode 字符串中删除重音符号的最佳方法是什么?

两个好的库作为解决方案:

Unicode(可能添加字符)和 unicodedata(troncate)

于 2013-07-14T17:05:14.390 回答