擅长:python、mysql、java
<p>可以将文件路径作为参数传递到函数中。在</p>
<p>所以:</p>
<pre class="lang-py prettyprint-override"><code>def getDataFromPdf(filePath):
acctNumberRegex = re.compile(r'\d\d\d\d\d-\d\d\d-\d\d\d\d')
pdfFile = open(filePath + 'records.pdf', 'rb')
reader = PyPDF2.PdfFileReader(pdfFile)
for pageNum in range(0,10):
page = reader.getPage(pageNum).extractText()
accounts = acctNumberRegex.findall(page)
for acct in accounts:
if acct not in results:
results.append(acct)
print(len(results))
</code></pre>