我想将变量发送到find_all(),这是我的代码:
import requests
from bs4 import BeautifulSoup
url = 'https://www.elheraldo.co'
page = requests.get(url)
soup = BeautifulSoup(page.content, 'html.parser')
new = soup.find_all(class_=['titulo'])
在上面的示例中,我使用了两个常量,url和class_=['titulo'],但考虑到这些数据,我想自动执行此操作:
newspapers = {'https://www.elheraldo.co': "class_=['titulo']",
'https://www.zonacero.com': "'div', class_=['title']",
'https://www.elpilon.com.co': "class_=['land-see-post-title']",
'https://www.eluniversal.com.co': "'div', class_=['headline']",
'https://www.diariodelcesar.com': "'h2', class_=['title']",
'https://www.hoydiariodelmagdalena.com.co': "'h2', class_=['title']",
'https://www.diariodelnorte.net': "'h3', itemprop=['name']"}
这可以像这样自动化:
我建议将字典的值作为CSS选择器,您可以将其放入
soup.select()
方法中例如:
印刷品:
相关问题 更多 >
编程相关推荐