刮某属性靓汤Python

import requests from bs4 import BeautifulSoup page_number = 1 flippa_page = requests.get('https://www.flippa.com/search?filter[property_type]=domain&filter[status]=won&filter[sale_method]=auction&page[number]={}&page[size]=250'.format(page_number)) price_list = [] domain_list = [] for i in range(120): src = flippa_page.content soup = BeautifulSoup(src, 'lxml') for span_tag in soup.find_all('span'): domain_list.append(span_tag.attrs['class']) page_number += 1

3条回答

网友

1楼 · 编辑于 2024-10-03 21:28:05

使用html.parser而不是lxml，我可以找到span标记：

for item in soup.find_all('span'): 
     if (str(item.contents).find('CPCAdvertising.com')) > -1: 
         print(item) 

<span class="Basic___propertyName">CPCAdvertising.com</span>
<span class="Basic___title">CPCAdvertising.com - One Dollar Reserve !!</span>

由于某种原因，我无法用lxml进行解析，如果您能告诉我您使用的lxml库是什么，我可以用它进行检查

网友

2楼 · 编辑于 2024-10-03 21:28:05

因为你的网址不适合我，我用的是同一个网站的不同网址。无论如何，您可以在find_all()命令中指定类，如下所示：

import requests
from bs4 import BeautifulSoup

flippa_page = requests.get('https://flippa.com/10339489-e-commerce-sports-and-outdoor')
src = flippa_page.content
soup = BeautifulSoup(src, 'lxml')

for s in soup.find_all('span', {'class': 'ListingList-itemPrice'}):
    # Print out the text within the tag
    print(s.text.strip())

网友

3楼 · 编辑于 2024-10-03 21:28:05

单词应该在span_tag.string中

相关问题更多 >

编程相关推荐

热门问题

热门文章