用Python抓取网页价格

import requests from bs4 import BeautifulSoup as bs bURL = 'https://www.thewhiskyexchange.com/c/540/taiwanese-whisky' headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/74.0.3729.169 Safari/537.36'} r = requests.get('https://www.thewhiskyexchange.com/c/540/taiwanese-whisky') soup = bs(r.content, 'lxml') productlist = soup.find_all('div', class_='item') productlinks = [] for item in productlist: for link in item.find_all('a', href=True): print(link['href'])

1条回答

网友

1楼 · 发布于 2024-09-27 21:24:18

视频发布后，该网站的结构发生了变化

我已修复了以下代码：

import requests 
from bs4 import BeautifulSoup as bs

bURL = 'https://www.thewhiskyexchange.com/c/540/taiwanese-whisky'
headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/74.0.3729.169 Safari/537.36'} 

r = requests.get(bURL, headers=headers)

soup = bs(r.text, 'html.parser')

for x in soup.find_all('li', {'class':'product-grid__item'}):
    link = x.find('a')
    print(x.text, 'https://www.thewhiskyexchange.com'+link['href'])

相关问题更多 >

编程相关推荐

热门问题

热门文章