在没有唯一类或标识符的情况下，如何创建webscrape？

import requests from bs4 import BeautifulSoup url = 'https://www.wunderground.com/history/daily/gb/christchurch/EGHH/date/2019-8-11' response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser')

2条回答

网友

1楼 · 编辑于 2024-10-02 04:33:57

尝试使用：

class="mat-cell cdk-cell cdk-column-temperature mat-column-temperature ng-star-inserted"

网友

2楼 · 编辑于 2024-10-02 04:33:57

如果您想要刮取的元素没有class或id，那么您可以使用xpath获得它

import lxml.html
import requests

url = "https://www.wunderground.com/history/daily/gb/christchurch/EGHH/date/2019-8-11"
path = "/html/body/app-root/app-history/one-column-layout/wu-header/sidenav/mat-sidenav-container/mat-sidenav-content/div/section/div[1]/lib-city-header/div[1]/div/div/a[1]/lib-display-unit/span/span[1]/text()"

response = requests.get(url)
tree = lxml.html.fromstring(response.text)
temperature = tree.xpath(path)

if temperature:
    print(temperature[0])

相关问题更多 >

编程相关推荐

热门问题

热门文章

在没有唯一类或标识符的情况下，如何创建webscrape？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >