python - Tabula Py 未检测 PDF 中的表格

Question

我使用了以下代码，但没有检测到任何内容。我也尝试过其他各种 PDF 格式，但得到了相同的结果。

from tabula import read_pdf,convert_into

df=read_pdf("1415_048.pdf",output_format="dataframe",encoding='utf-8',java_options=None,multiple_tables=True)

PDF看起来像这样

[] #This is the result I'm getting

score 3 · Accepted Answer

tabula-py 基于 tabula-java。它仅适用于基于文本的 PDF。

根据 tabula.app 网站https://tabula.technology/：

注意：Tabula 仅适用于基于文本的 PDF，不适用于扫描的文档。

python - Tabula Py 未检测 PDF 中的表格

1 回答 1

Related

Reference