python - tesseract ocr 的 PIL 预处理

Question

如何提高 OCR 的准确性？

我正在使用pyocr调用tesseract 二进制文件，使用wand将 pdf 转换为图像，然后使用Pillow处理图像以进行 OCR。

已附上所有图片

我觉得这是可以做到的最好的预处理。

imgf = img.convert('RGB') #to draw a line in between
draw = Draw(imgf)
x,y = imgf.size
eX,eY = 20,800
box =  (x/2 - eX/2, y/2 - eY/2, x/2 + eX/2, y/2 + eY/2)
draw.ellipse(box, fill=0)
del draw


im2 = imgf.filter(ImageFilter.MinFilter(1))  #filter
im2 = im2.filter(ImageFilter.SMOOTH_MORE)
im2 = im2.filter(ImageFilter.SMOOTH_MORE)

for img in req_image:   # OCR
    txt = tool.image_to_string(
        im2,
        lang=lang,
        builder=pyocr.builders.DigitBuilder()
    )
print text

图像最初是从 PDf 中裁剪出来的，然后转换为灰度，然后上面的代码对其进行处理。

我在两者之间添加了一条线，发现它大大提高了准确性（感觉它会起作用）

1 准确 2 准确 3 不准确返回 6563 8 1 4 来自 pdf 的灰度图像

score 1 · Accepted Answer

微软发布了一个很棒的 api，叫做认知服务。您可以使用它来进行图像识别。

https://azure.microsoft.com/en-us/services/cognitive-services/

python - tesseract ocr 的 PIL 预处理

1 回答 1

Related

Reference