0

I have had tons of success using Tabula to convert PDFs to CSV files, but this particular one is causing me all kinds of issues. The file can be found at here.

It seems the multiple row spans is causing Tabula headaches. I would not expect Tabula to perfectly convert the file and I would expect that I would need to do some post-processing cleanup (usually a few sed commands), but I am not even getting close to creating a CSV file that is a starting point. I have tried a spreadsheet, no-spreadsheet, guess, columns, and area with no success. Does anyone have any other ideas about what to try?

4

0 回答 0