python - 为什么我的输出的每一行前都有一个“u”？

Question

只是想知道在我的代码的每一行之前的“u”的意义是什么，以及我如何能够删除它们？我在 python 中工作。

Last login: Mon Jul  1 09:58:27 on ttys000
Samuel-Finegolds-MacBook-Pro:~ samuelfinegold$ /var/folders/jv/9_sy0bn10mbdft1bk9t14qz40000gn/T/Cleanup\ At\ Startup/tutor-394379967.500.py.command ; exit;
{u'company': {u'address': {u'city': u'Chicago',
                           u'contactname': '',
                           u'geo': {u'latitude': u'41.92113',
                                    u'longitude': u'-87.70085'},
                           u'state': u'IL',
                           u'street_address': '',
                           u'zip': u'60647'},
              u'companyname': u'Wyzant',
              u'costtype': '',
              u'description': u'WyzAnt is the leading tutoring marketplace on the web with 67,000+ tutors offering private lessons in hundreds of subjects like math, science, test prep, foreign languages, music, computers and much more.',
              u'email': '',
              u'facebook': u'https://www.facebook.com/WyzAnt',
              u'image': '',
              u'language': '',
              u'linkedin': '',
              u'logo': '',
              u'phone': u'8779992681',
              u'program': {u'costrange': u'[]',
                           u'costtype': '',
                           u'programtype': ''},

score 7 · Accepted Answer

u用于创建 unicode 字符串：

>>> unicode_string = u'my unicode string'
>>> type(unicode_string)
<type 'unicode'>
>>> ascii_string = 'my ascii string'
>>> type(ascii_string)
<type 'str'>

您可以使用以下方法转换 unicode 字符串str：

>>> converted_string = str(unicode_string)
>>> type(converted_string)

但是，只有当您的 unicode 字符串中的字符可以使用 ascii 表示时，这才有可能：

>>> unicode_string = u'ö'
>>> converted_string = str(unicode_string)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
UnicodeEncodeError: 'ascii' codec can't encode character u'\xf6' in position 0: ordinal not in range(128)

您可以在http://docs.python.org/2/howto/unicode.html阅读有关 Python 的 unicode 字符串的更多信息

score 5 · Accepted Answer

u表示它是一个 unicode 字符串，如果该字符串仅包含 ASCII 字符，则无需转换为正常str，因为：

>>> "foo" == u"foo"
True

但是您不能将 unicode 字符串与包含非 ASCII 字符的字节字符串进行比较：

>>> u'ö' == 'ö'
False
>>> 'ö'       #contains bytes
'\xc3\xb6'
>>> u'ö'      #contains sequence of code-points 
u'\xf6'

只有将字节字符串转换为 unicode（使用正确的编码）才能进行比较：

>>> u'ö' == 'ö'.decode('utf-8')
True

文档：Unicode HOWTO

Ned Batchelder 的 ppt：实用的 Unicode：我如何止痛？

score 4 · Accepted Answer

字符串前面的小写字母u表示它是一个 unicode 字符串。这只是编码，因此根本没有害处。£Unicode 字符串能够表示比普通字符串更广泛的字符（例如），并且u不会在prints 中显示：

>>> print(u'hi')
'hi'

您可以从 python 文档中了解有关 unicode 字符串的更多信息：http: //docs.python.org/3/howto/unicode.html

score 2 · Accepted Answer

要删除 unicode，请使用类型转换。

    >>> x = u'abcd'
    >>> type(x)
    <type 'unicode'>
    >>> y = str(x)
    >>> type(y)
    <type 'str'>

python - 为什么我的输出的每一行前都有一个“u”？

4 回答 4

Related

Reference