擅长:python、mysql、java
<p>我建议<a href="http://docs.python.org/py3k/library/urllib.request.html" rel="nofollow">python's urllib</a>。在</p>
<blockquote>
<p>Fetching Web Pages</p>
<p>Fetching standard Web pages over HTTP is very easy with Python:</p>
<p>import urllib
f = urllib.urlopen("http://www.python.org") <br/>
s = f.read() <br/>
f.close() <br/></p>
</blockquote>
<p>--<a href="http://www.boddie.org.uk/python/HTML.html" rel="nofollow">this is from here</a></p>
<p>然后使用<a href="http://docs.python.org/py3k/library/html.parser.html" rel="nofollow">python's html parser</a></p>