擅长:python、mysql、java
<p>如果您坚持要签出库<a href="https://pypi.org/project/PyPDF2/" rel="nofollow noreferrer">pyPDF2</a>,请给出一个建议。如果您的PDF格式良好,则非常易于使用。代码示例看起来很简单,如下所示:</p>
<pre><code> from PyPDF2 import PdfFileReader
def extract_information(pdf_path):
with open(pdf_path, 'rb') as f:
pdf = PdfFileReader(f)
information = pdf.getDocumentInfo()
number_of_pages = pdf.getNumPages()
</code></pre>
<p><a href="http://www.unixuser.org/%7Eeuske/python/pdfminer/index.html" rel="nofollow noreferrer">PDFMiner</a>也是一个很好的例子</p>
<p>这篇来自<a href="https://realpython.com/pdf-python/#how-to-extract-document-information-from-a-pdf-in-python" rel="nofollow noreferrer">RealPython</a>博客的文章有点老,但也是一个很好的信息来源</p>