python - python 2中的未知输出unicode

Question

我试图从NLP UPC 研究组实现这段代码来检索一些输入单词的同义词。当我运行测试方法时

def test():
    "tests some functions"
    a=wn.get_words(True)
    print  'length of a: ', len(a)
    print 'a[0]: ', a[0].tostring().decode('utf-8')

输出是未知编码

length of a:  16043
a[0]:  �����

在相同的代码中，Unicode 已经被声明为

def _encode(data):
    return data.encode('utf8')

并且我使用的平台（net beans 7.2.1）配置为支持 utf-8 编码

如何解决这个问题呢？

score 1 · Accepted Answer

如果您已将设置配置为处理 UTF-8，则无需将字符串解码为 Unicode 对象。然后会发生的是 Python 使用检测到的当前编码sys.stdout。

尝试不解码：

print 'a[0]: ', a[0].tostring()

score 0 · Accepted Answer

谢谢你的答案。我改用了这个命令，它对我有用

print 'a[0]: ', a[0].encode('utf-8')

python - python 2中的未知输出unicode

2 回答 2

Related

Reference