擅长:python、mysql、java
<p>多亏了你的领导,这才是解决问题的办法,我希望有一天它会对某些人有所帮助:</p>
<pre><code>from selenium import webdriver
from bs4 import BeautifulSoup
browser = webdriver.Firefox()
browser.get('http://uk.easyroommate.com/results-room/loc/981238/pag/1')
html_source = browser.page_source
browser.quit()
soup = BeautifulSoup(html_source,'html.parser')
print soup.prettify()
## You are now able to see the HTML generated by javascript code and you
## can extract it as usual using BeautifulSoup
for el in soup.findAll('div', class_="listing-meta listing-meta small"):
print el.find('a').get('href')
</code></pre>
<p>同样在我的例子中,我只想提取这些链接,但是一旦您通过Selenium获得了web页面源代码,那么使用beauthoulsoup并获得所需的每一项都是小菜一碟。在</p>