Find centralized, trusted content and collaborate around the technologies you use most.
Teams
Q&A for work
Connect and share knowledge within a single location that is structured and easy to search.
我正在尝试从 python 中的 PDF 中提取文本,但我收到以下警告消息,它限制了提取的每个页面的文本量。有没有人能想到的解决方案来解决这个问题?代码也在下面:
警告:pdfminer.layout:要分组的框(106)太多,跳过。
import slate3k as slate with open("mypdf.pdf",'rb') as f: extracted_text = slate.PDF(f) print(extracted_text)