如何使用BeautifulSoup刮取超链接标题？

import json from bs4 import BeautifulSoup import requests import csv from datetime import datetime url = 'https://viewyourdeal-gabrielsimone.com' gmaInfo=[] response = requests.get(url, timeout=5) content = BeautifulSoup(response.content, "html.parser") for info in content.findAll('div', attrs={"class" : "wrapper ease-animation"}): gridObject = { "title" : info.find('div', attrs={"class" : "title animation allgrey"}), "price" : info.find('span', attrs={"class":"red-price"}).text } print(gridObject) with open('index.csv', 'w') as csv_file: writer = csv.writer(csv_file) writer.writerow([gridObject])

2条回答

网友

1楼 · 编辑于 2024-09-30 10:33:11

我对我的div类太具体了，我把类改成了简单的标题，效果很好

网友

2楼 · 编辑于 2024-09-30 10:33:11

在下面的代码中，很少有项返回为None。只需提供If条件If元素exists获取文本

from bs4 import BeautifulSoup
import requests
import csv
from datetime import datetime

url = 'https://viewyourdeal-gabrielsimone.com'

gmaInfo=[]
response = requests.get(url, timeout=5)
content = BeautifulSoup(response.content, "html.parser")

for info in content.findAll('div', attrs={"class" : "wrapper ease-animation"}):
   if info.find('div', attrs={"class": "title animation allgrey"}):
     gridObject = {
            "title" : info.find('div', attrs={"class" : "title animation allgrey"}).text.strip(),
            "price" : info.find('span', attrs={"class":"red-price"}).text
            }
     print(gridObject)
     with open('index.csv', 'w') as csv_file:
        writer = csv.writer(csv_file)
        writer.writerow([gridObject])

相关问题更多 >

编程相关推荐

热门问题

热门文章