擅长:python、mysql、java
<p>你可以尝试使用lxml库。在</p>
<p><a href="http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/" rel="nofollow">lxml article</a></p>
<pre><code>from lxml.html import parse
doc = parse('http://java.sun.com').getroot()
post = doc.cssselect('div#topmenucontainer')
</code></pre>