擅长:python、mysql、java
<p>我相信你会得到urllib.error.HTTPError:HTTP错误403:禁止错误。在</p>
<p>您可以使用</p>
<pre><code>import lxml.html
import lxml.etree
from urllib.request import Request, urlopen
req = Request('http://www.inquirer.net/', headers={'User-Agent': 'Mozilla/5.0'})
res = urlopen(req).read()
html_content = lxml.html.fromstring(r)
root = html_content.xpath('//*[@id="tgs3_info"]/h2')
print(root)
</code></pre>