擅长:python、mysql、java
<p>您需要修改xpath,因为并不是所有的<code>td</code>元素都有{<cd2>}。
请尝试以下xpath表达式:<code>//td//text()</code>。在</p>
<pre><code>import urllib
from lxml import etree
budgeturl = "http://www.the-numbers.com/movie/budgets/all"
s = urllib.urlopen(budgeturl).read()
htmlpage = etree.HTML(s)
htmltable = htmlpage.xpath("//td//text()")
</code></pre>
<p>输出:
<a href="https://i.stack.imgur.com/vsDW8.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/vsDW8.png" alt="enter image description here"/></a></p>