0

例如,使用 MediaFileUpload可以参考创建/上传文件到 Google Drive 的基本文档。

但是,虽然我有创建文件的代码,但会从 HTML 转换为 Google Doc 格式。当它们仅包含 ASCII 字符时,它可以完美运行,但是当我添加非 ASCII 字符时,它会失败,并带有以下回溯:

Traceback (most recent call last):
  File "d:\my\py\ckwort.py", line 949, in <module>
    rids, worker_documents = analyze( meta, gd )
  File "d:\my\py\ckwort.py", line 812, in analyze
    gd.mkdir( **iy )
  File "d:\my\py\ckwort.py", line 205, in mkdir
    self.create( **( kw['subop']))
  File "d:\my\py\ckwort.py", line 282, in create
    media_body=kw['media_body'],
  File "D:\my\py\gdrive2\oauth2client\util.py", line 120, in positional_wrapper
    return wrapped(*args, **kwargs)
  File "D:\my\py\gdrive2\apiclient\http.py", line 676, in execute
    headers=self.headers)
  File "D:\my\py\gdrive2\oauth2client\util.py", line 120, in positional_wrapper
    return wrapped(*args, **kwargs)
  File "D:\my\py\gdrive2\oauth2client\client.py", line 420, in new_request
    redirections, connection_type)
  File "D:\my\py\gdrive2\httplib2\__init__.py", line 1597, in request
    (response, content) = self._request(conn, authority, uri, request_uri, method, body, headers, redirections, cachekey)
  File "D:\my\py\gdrive2\httplib2\__init__.py", line 1345, in _request
    (response, content) = self._conn_request(conn, request_uri, method, body, headers)
  File "D:\my\py\gdrive2\httplib2\__init__.py", line 1282, in _conn_request
    conn.request(method, request_uri, body, headers)
  File "C:\Python27\lib\httplib.py", line 958, in request
    self._send_request(method, url, body, headers)
  File "C:\Python27\lib\httplib.py", line 992, in _send_request
    self.endheaders(body)
  File "C:\Python27\lib\httplib.py", line 954, in endheaders
    self._send_output(message_body)
  File "C:\Python27\lib\httplib.py", line 812, in _send_output
    msg += message_body
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 370: ordinal not in range(128)

我没有找到任何参数来指定 MediaFileUpload 应该使用什么文件编码(我的文件使用 UTF-8)。我错过了什么吗?

后来:我突然想到尝试添加

<meta content="text/html; charset=utf-8" http-equiv="Content-Type">

到每个源文件。没有伤害 ASCII 的,但没有帮助那些包含非 ASCII UTF-8 字符的。

稍后:我已经通过对序数大于 127 的所有字符使用 HTML 实体(表示 Unicode 字符的 ASCII 字符序列)来解决这个问题。但这不应该是必需的。

4

0 回答 0