我正在努力从rotowire.com上获取每场棒球比赛的降雨机会和温度/风速。一旦我刮取了数据,我将把它转换成三列——雨、温度和风。多亏了另一位用户,我才能够接近获取数据,但却无法完全做到这一点。我试过两种方法
第一种方法:
from bs4 import BeautifulSoup
import requests
import pandas as pd
url = 'https://www.rotowire.com/baseball/daily-lineups.php'
r = requests.get(url)
soup = BeautifulSoup(r.text, "html.parser")
weather = []
for i in soup.select(".lineup__bottom"):
forecast = i.select_one('.lineup__weather-text').text
weather.append(forecast)
这将返回:
['\n100% Rain\r\n 66°\xa0\xa0Wind 8 mph In ', '\n0% Rain\r\n 64°\xa0\xa0Wind 4 mph L-R ', '\n0% Rain\r\n 69°\xa0\xa0Wind 7 mph In ', '\nDome\r\n In Domed Stadium\r\n ', '\n0% Rain\r\n 75°\xa0\xa0Wind 10 mph Out ', '\n0% Rain\r\n 68°\xa0\xa0Wind 9 mph R-L ', '\n0% Rain\r\n 82°\xa0\xa0Wind 9 mph ', '\n0% Rain\r\n 81°\xa0\xa0Wind 5 mph R-L ', '\nDome\r\n In Domed Stadium\r\n ', '\n1% Rain\r\n 75°\xa0\xa0Wind 4 mph R-L ', '\n1% Rain\r\n 71°\xa0\xa0Wind 6 mph Out ', '\nDome\r\n In Domed Stadium\r\n ']
我尝试过的第二种方法是:
from bs4 import BeautifulSoup
import requests
import pandas as pd
url = 'https://www.rotowire.com/baseball/daily-lineups.php'
r = requests.get(url)
soup = BeautifulSoup(r.text, "html.parser")
#weather = []
for i in soup.select(".lineup__bottom"):
forecast = i.select_one('.lineup__weather-text').text
weather.append(forecast)
#print(forecast)
rain = i.select_one('.lineup__weather-text b:contains("Rain") ~ span').text
这将返回一个AttributeError that 'NoneType' object has no attribute 'text'
您可以找到带有游戏信息的卡,并在底部找到天气数据(如果有):
输出:
要查找所有数据,请参见此示例:
输出:
相关问题 更多 >
编程相关推荐