我可以从img标记中使用BeautifulSoup刮取“value”属性吗？

webpage = requests.get('https://www.newegg.com/p/pl?Submit=StoreIM&Depa=1&Category=38') content = webpage.content soup = BeautifulSoup(content, 'lxml') containers = soup.find_all("div", class_="item-container") brand = [] for container in containers: cont_brand = container.find_all("div",{"class":"item-info"}) for name_brand in cont_brand: brand.append(name_brand.find("img").get("alt")) print(brand)

1条回答

网友

1楼 · 发布于 2024-09-29 23:15:58

这是因为您的第一个for循环返回所有元素。但是，当您将下一个for循环置于外部for循环之外时，它总是给您最后一个元素。它应该是内外循环

现在试试看

webpage = requests.get('https://www.newegg.com/p/pl?Submit=StoreIM&Depa=1&Category=38')
content = webpage.content
soup = BeautifulSoup(content, 'lxml')

containers = soup.find_all("div", class_="item-container")

brand = []

for container in containers:
    cont_brand = container.find_all("div",{"class":"item-info"})
    for name_brand in cont_brand:
        brand.append(name_brand.find("img").get("alt"))
print(brand)

输出：

['EVGA', 'MSI', 'ASUS', 'MSI', 'Sapphire Tech', 'EVGA', 'GIGABYTE', 'XFX', 'ASUS', 'ASRock', 'EVGA', 'ASUS', 'EVGA', 'GIGABYTE', 'GIGABYTE', 'GIGABYTE', 'EVGA', 'EVGA', 'MSI', 'ASRock', 'EVGA', 'XFX', 'Sapphire Tech', 'ASRock', 'GIGABYTE', 'ASUS', 'MSI', 'MSI', 'MSI', 'MSI', 'MSI', 'EVGA', 'GIGABYTE', 'EVGA', 'ASUS', 'GIGABYTE']

如果您有BS4.7.1或更高版本，则可以使用此css选择器

webpage = requests.get('https://www.newegg.com/p/pl?Submit=StoreIM&Depa=1&Category=38')
content = webpage.content
soup = BeautifulSoup(content, 'lxml')

brand = []

for name_brand in soup.select(".item-container .item-info"):
        brand.append(name_brand.find_next('img').get("alt"))
print(brand)

相关问题更多 >

编程相关推荐

热门问题

热门文章