问题标签 [pytesser]

问问题

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

102 问题

0 投票

1 回答

2232 浏览

python - Recognize simple digits with pytesser

I'm learning OCR using PyTesser and Tesseract. As the first milestone, I want to write a tool to recognize captcha that simply consists of some digits. I read some tutorials and wrote such a test program.

I tested my code with the image below. But the result is 2(T?770. And I've tested some other similar images as well, in 80% case the results are incorrect.

enter image description here

I'm not familiar with imaging processing. I've two questions here:

Is it possible to tell PyTesser to guess digits only?
I think the image is quite easy for human to read. If it is so difficult for PyTesser to read digits only image, is there any alternatives can do a better OCR?

Any hints are very appreciated.

2014-06-16T16:02:21.530

0 投票

1 回答

3068 浏览

python - 如何使 pytesser (Tesseract) 工作？

我正在尝试让 pytesser（可在此处下载）在我的 mac OS 上运行，但我没有成功。

我安装了 Tesseract、PIL 和所有依赖项。

我在我的 python lib 文件夹中解压缩了 pytesser 并将脚本文件修改为__init__.py init 文件中我按照此处和此处tesseract.exe的建议修改了文件的路径：

这就是我得到的错误：

似乎该模块无法运行 .exe 文件。我试图更改路径，添加扩展名 .exe 但我总是得到同样的错误。

python macos tesseract pytesser

2014-09-28T12:46:52.410

0 投票

1 回答

4226 浏览

python - 使用 PyTesser 破解简单的验证码

我PyTesser用来打破一个captcha. PyTesser 使用tesseractpython ocr 库。在将图像放入 PyTesser 之前，我使用了一些过滤。一步一步我的代码：输入图像是：

应用此代码后，输出为：

现在，

在此代码段输出图像之后是：

在此处输入图像描述

最后，当我应用这个时

我得到输出%/ww

请帮助我找到正确的结果。

如果我尝试使用这些图像，此代码可以成功识别字母。

在此处输入图像描述

python ocr tesseract pytesser

2015-02-10T06:17:08.167

0 投票

1 回答

7454 浏览

python - pytesseract 显示“‘str’对象没有属性‘save’”错误

当我为 pytesseract 运行以下代码时

它显示以下错误

这个错误是什么意思？我该如何纠正这个？

提前致谢

python python-2.7 numpy pytesser

2015-02-15T16:01:43.637

0 投票

1 回答

835 浏览

python - Python 2.7.9 中的 Pytesser - 打印 image_to_string(image) 错误

我正在将 Pytesser 用于 python OCR。我已经安装了 PIL 和 Pytesser 并运行了以下代码：

然后出现这个错误

这意味着什么？我做错了什么？我正在使用 Windows 8 64 位和 Python 2.7.9

python python-2.7 ocr pytesser

2015-03-12T04:44:41.620

0 投票

2 回答

1277 浏览

python-2.7 - 使用 pytesseract 时出错

我正在使用 pytesseract 将图像转换为文本。我使用 pip 命令成功安装了 pytesseract。但是当我运行脚本时，它显示错误：No module named Tesseract。

这些是我的代码：

错误：

python-2.7 pytesser

2015-04-26T04:49:43.823

0 投票

1 回答

1015 浏览

python - pytesser subprocess.Popen 失败

从昨天开始，我试图使用 OCR pytesser。我自己解决了几个问题，但我不知道如何解决这个问题。有错误：

最后一行说“找不到文件”

我如何将 tesseract 放入我的init .py

我真的不明白为什么他不能打开文件。在我的init .py中还有另外两件事。我可以更改我尝试创建的图像文件和 txt 文件并给他路径没有成功，但我认为他自己创建了它们。

这是发送到 Popen 的 3 个文件，所以我想错误就在那里。

我希望我足够清楚，让你们理解我遇到的问题。

编辑：lol.py 中的内容来自该站点，只是修改了网址http://www.debasish.in/2012/01/bypass-captcha-using-python-and.html

python ocr pytesser

2015-05-05T19:01:38.007

0 投票

1 回答

3026 浏览

python - pytesser - next line of text in image?

I'm using pytesser on simple images with plain text. It works great! However, in python it prints each line of text on a new line. But the string it outputs has no "\n" or new line delimiters that I can pull out.

How does it print each new line of the image on a new line in the console? And is there a way I can pull out a particular line? or split them myself?

It's more than likely something very simple i'm missing...

Output:

python pytesser

2015-05-27T13:40:39.647

0 投票

1 回答

475 浏览

python - 如何加载数学 tesseract 模块？

所以我是新手使用 tesseract，我想加载数学输入模块。不幸的是，我不知道如何将它与此链接中的数学模块一起使用。如何正确加载？它会默认加载经过训练的数据吗？我已经将经过训练的数据添加到适当的 tessdata 文件夹中了吗？我无法弄清楚 lang 参数的 isocode 应该是什么？是垫子之类的东西吗？关于这个问题的文档非常有限，我们将不胜感激。

我也在用 pytesseract 进行编码，但如果它不支持更改训练数据集，我对其他模块持开放态度。

python ocr tesseract pytesser

2015-06-27T17:35:09.430

0 投票

1 回答

10062 浏览

python - 在 Windows 上开始使用 Python OCR？

我以前从未使用过python，我不知道从哪里开始。我的目标是获取数字和多色背景的图像数据，并可靠地识别出正确的字符。我查看了为此所需的工具，我发现了 Anaconda python 发行版，其中包括我可能需要的所有可能的包，以及 tesseract-ocr 和 pytesser。

不幸的是，我不知道如何开始。我正在使用 PyCharm 社区 IDE，只是尝试遵循本指南： http: //www.manejandodatos.es/2014/11/ocr-python-easy/ 来掌握 OCR。

这是我正在使用的代码：

我相信我正在使用的 Anaconda 发行版有 PIL，但我收到了这个错误：

谁能指出我正确的方向？

python python-imaging-library anaconda python-tesseract pytesser

2015-06-28T19:37:55.727

1 2 3 4 5 6 7 8 9 10

问题标签 [pytesser]

Reference