从HTML中提取/解码CSS到Python

2条回答

网友

1楼 · 编辑于 2024-06-26 13:47:35

广告列表由JavaScript生成。BeautifulSoup为您提供了以下示例：

<ul class="search-results" data-bind="template: { name: 'room-template', foreach: $root.resultsViewModel.Results, as: 'resultItem' }"></ul>

我建议看一下：Getting html source when some html is generated by javascript和{a2}。在

网友

2楼 · 编辑于 2024-06-26 13:47:35

多亏了你的领导，这才是解决问题的办法，我希望有一天它会对某些人有所帮助：

from selenium import webdriver  
from bs4 import BeautifulSoup

browser = webdriver.Firefox()  
browser.get('http://uk.easyroommate.com/results-room/loc/981238/pag/1')  
html_source = browser.page_source  
browser.quit()

soup = BeautifulSoup(html_source,'html.parser')  
print soup.prettify()
## You are now able to see the HTML generated by javascript code and you 
## can extract it as usual using BeautifulSoup

for el in soup.findAll('div', class_="listing-meta listing-meta small"):
    print el.find('a').get('href')

同样在我的例子中，我只想提取这些链接，但是一旦您通过Selenium获得了web页面源代码，那么使用beauthoulsoup并获得所需的每一项都是小菜一碟。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

从HTML中提取/解码CSS到Python

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >