擅长:python、mysql、java
<p>如果可以使用urllib2以外的其他方法,<strong>Selenium</strong>可以轻松地执行此类任务,并进行实际的浏览器模拟:</p>
<pre><code>from selenium import webdriver
from bs4 import BeautifulSoup
url = 'http://nametrends.net/name.php?name=Ruby'
driver = webdriver.Firefox()
driver.get(url)
# wait until 'tabular' appears on browser
assert 'tabular' not in driver.page_source
html = BeautifulSoup(driver.page_source)
for table in html.find_all('table'):
print table
</code></pre>