python - Python：上传启用 OCR 的图像的 gdata 方法是什么？

Question

如此 PHP 代码所示，（http://code.google.com/p/gdata-samples/source/browse/trunk/doclist/OCRDemo/ocr.php?r=194）

可以将图像上传到自动转换为文本的谷歌文档。我想知道如何在python中做到这一点。有一个“上传”方法，但我只是对如何启用 OCR 功能感到困惑。

score 2 · Accepted Answer

假设您从这里开始： http ://code.google.com/apis/documents/docs/3.0/developers_guide_python.html

您client已经创建了一个经过身份验证的对象。

f = open('/path/to/your/test.pdf')
ms = gdata.data.MediaSource(file_handle=f, content_type='application/pdf', content_length=os.path.getsize(f.name))
folder = "https://docs.google.com/feeds/default/private/full" # folder in google docs.
entry = client.Upload(ms, f.name, folder_or_uri= folder + '?ocr=true') # ?ocr=true is the kicker

使用尾随参数指定 folder_or_uri?ocr=true是导致转换发生的原因。

创建后，您现在可以将其导出为 txt 文档。

python - Python：上传启用 OCR 的图像的 gdata 方法是什么？

1 回答 1

Related

Reference