擅长:python、mysql、java
<p>你需要知道如何计算这些文件的类型。看看这些库:</p>
<p>PDF:<a href="http://pybrary.net/pyPdf/" rel="nofollow noreferrer">pypdf</a></p>
<p>doc/docx:<a href="https://stackoverflow.com/questions/116139/how-can-i-read-a-word-2007-docx-file">this question</a>,<a href="https://github.com/mikemaccana/python-docx" rel="nofollow noreferrer">python-docx</a></p>
<p>odt:<a href="http://www.linuxjournal.com/article/9347?page=0,1" rel="nofollow noreferrer">examples here</a></p>