如何使用汤。选择和我做错了什么?

2024-10-01 15:38:17 发布

您现在位置:Python中文网/ 问答频道 /正文

我想让我的程序在亚马逊上获得产品的价格。我试图用soup.select来做这件事,但失败了(还没有用soup.find来做)。 右键单击items price上的inspect可显示以下html代码:

<span id="price_inside_buybox" class="a-size-medium a-color-price">
    ILS&nbsp;53.35
</span>

这是我的代码,我做错了什么,我应该如何正确使用soup.select

import bs4, requests,
amazonlink = "https://www.amazon.com/UNSCENTED-PURITO-Ingredients-sunscreen-Lightweight/dp/B07WQXP3D8/ref=as_li_ss_tl?crid=V29993T4TI1W&dchild=1&keywords=purito+comfy+water+sunblock&qid=1585239706&sprefix=purito+comfy,aps,299&sr=8-2&linkCode=sl1&tag=drdrayzday-20&linkId=7231993512a225ded0b474b852d105bc&language=en_US"

res = requests.get(amazonlink)
soup = bs4.BeautifulSoup(res.text,'html.parser') #creates soup object 
print(soup.select("price_inside_buybox"))
print(soup.find("span",id="price_inside_buybox")) 

编辑:我的输出为空列表或无


Tags: 代码idhtmlfindrequestsselectpricespan
2条回答

您需要添加user_agent并将解析器更改为lxml。 如果没有lxml解析器,则需要使用

pip install lxml

代码

import bs4, requests
amazonlink = "https://www.amazon.com/UNSCENTED-PURITO-Ingredients-sunscreen-Lightweight/dp/B07WQXP3D8/ref=as_li_ss_tl?crid=V29993T4TI1W&dchild=1&keywords=purito+comfy+water+sunblock&qid=1585239706&sprefix=purito+comfy,aps,299&sr=8-2&linkCode=sl1&tag=drdrayzday-20&linkId=7231993512a225ded0b474b852d105bc&language=en_US"
headers = {'User-Agent':'Mozilla/5.0'}
res = requests.get(amazonlink,headers=headers)
soup = bs4.BeautifulSoup(res.text,'lxml')
print(soup.select_one("#price_inside_buybox").text.strip())

这将是一个更好、更干净的解决方案,因为如果amazon出于任何原因更改其html,您的脚本将无法再工作,那么就使用AmazonAPI

https://docs.aws.amazon.com/AWSECommerceService/latest/DG/Welcome.html

相关问题 更多 >

    热门问题