擅长:python、mysql、java
<p>您需要一个选择器,该选择器以src以.jpg结尾的所有img为目标,但还需要排除几个其他早期匹配项。使用<code>:not</code>和前面的<code>.a-row</code>可以做到这一点。最后,您需要使用set来清除唯一项</p>
<pre><code>import requests
from bs4 import BeautifulSoup as bs
from pprint import pprint
r = requests.get('https://www.amazon.es/b/ref=sv_ap_arrow_ce_4_1_1_1?node=934359031', headers = {'User-Agent':'Mozilla/5.0'})
soup = bs(r.content, 'lxml')
images = set(i['src'] for i in soup.select('.a-row img[src$=jpg]:not(.bxc-grid__row:nth-child(1) img[src$=jpg])'))
pprint(images)
</code></pre>