<p>在我看来,你有四种可能:</p>
<ul>
<li><p>您可以使用<a href="https://github.com/chezou/tabula-py" rel="nofollow noreferrer">tabula</a></p></li>
<li><p>您可以使用pdf to text将pdf转换为文本,然后使用python解析文本</p></li>
<li><p>您可以使用外部工具,将pdf文件转换为excel或csv,然后使用必需的python模块打开excel/csv文件。</p></li>
<li><p>您还可以将pdf转换为图像文件,然后使用任何最新的OCR软件(自动从图片重建表格)来获取数据</p></li>
</ul>
<p>你的问题与以下类似:</p>
<ul>
<li><p><a href="https://stackoverflow.com/questions/28532770/extract-identify-tables-from-pdf-python">Extract / Identify Tables from PDF python</a></p></li>
<li><p><a href="https://stackoverflow.com/questions/27927880/extracting-tables-from-a-pdf">Extracting tables from a pdf</a></p></li>
<li><p><a href="https://stackoverflow.com/questions/17591426/extract-table-from-a-pdf">Extract table from a PDF</a></p></li>
<li><p><a href="https://stackoverflow.com/questions/25125178/how-to-scrape-tables-in-thousands-of-pdf-files">How to scrape tables in thousands of PDF files?</a></p></li>
<li><p><a href="https://stackoverflow.com/questions/29868541/pdf-data-and-table-scraping-to-excel">PDF Data and Table Scraping to Excel</a></p></li>
<li><p><a href="https://stackoverflow.com/questions/17217194/extracting-table-contents-from-a-collection-of-pdf-files/26110587#26110587">Extracting table contents from a collection of PDF files</a></p></li>
</ul>
<p>问候</p>