擅长:python、mysql、java
<p>以下内容摘自文档(<a href="https://pythonhosted.org/PyPDF2/PageObject.html" rel="nofollow noreferrer">https://pythonhosted.org/PyPDF2/PageObject.html</a>)</p>
<blockquote>
<p>extractText() Locate all text drawing commands, in the order they are
provided in the content stream, and extract the text. This works well
for some PDF files, but poorly for others, depending on the generator
used. This will be refined in the future. Do not rely on the order of
text coming out of this function, as it will change if this function
is made more sophisticated. Returns: a unicode string object.</p>
</blockquote>
<p>因此,这个函数的性能似乎取决于pdf本身。</p>