python - 在 Windows 上开始使用 Python OCR？

Question

我以前从未使用过python，我不知道从哪里开始。我的目标是获取数字和多色背景的图像数据，并可靠地识别出正确的字符。我查看了为此所需的工具，我发现了 Anaconda python 发行版，其中包括我可能需要的所有可能的包，以及 tesseract-ocr 和 pytesser。

不幸的是，我不知道如何开始。我正在使用 PyCharm 社区 IDE，只是尝试遵循本指南： http: //www.manejandodatos.es/2014/11/ocr-python-easy/ 来掌握 OCR。

这是我正在使用的代码：

from PIL import Image
from pytesser import *

image_file = 'menu.jpg'
im = Image.open(image_file)
text = image_to_string(im)
text = image_file_to_string(image_file)
text = image_file_to_string(image_file, graceful_errors=True)
print "=====output=======\n"
print text

我相信我正在使用的 Anaconda 发行版有 PIL，但我收到了这个错误：

C:\Users\diego_000\Anaconda\python.exe C:/Users/diego_000/PycharmProjects/untitled/test.py
Traceback (most recent call last):
  File "C:/Users/diego_000/PycharmProjects/untitled/test.py", line 2, in <module>
    from pytesser import *
  File "C:\Users\diego_000\PycharmProjects\untitled\pytesser.py", line 6, in <module>
    import Image
ImportError: No module named Image

Process finished with exit code 1

谁能指出我正确的方向？

score 2 · Accepted Answer

The document you point to says to use

from PIL import Image

except you use

import Image

and so the interpreter properly says:

ImportError: No module named Image

It looks as if you reordered the lines

from PIL import Image
from pytesser import *

and that pytesser has a improperly coded dependency on PIL. but I can't be certain with the code you provided.

python - 在 Windows 上开始使用 Python OCR？

1 回答 1

Related

Reference