如何使用RandomForestRegressor方法预测scikitlearn、pandas在Python中的未来结果？

import pandas as pd from sportsreference.ncaab.teams import Teams from sklearn.ensemble import RandomForestRegressor from sklearn.model_selection import train_test_split FIELDS_TO_DROP = ['away_points', 'home_points', 'date', 'location', 'losing_abbr', 'losing_name', 'winner', 'winning_abbr', 'winning_name', 'home_ranking', 'away_ranking'] dataset = pd.DataFrame() teams = Teams() for team in teams: dataset = pd.concat([dataset, team.schedule.dataframe_extended]) X = dataset.drop(FIELDS_TO_DROP, 1).dropna().drop_duplicates() y = dataset[['home_points', 'away_points']].values X_train, X_test, y_train, y_test = train_test_split(X, y) parameters = {'bootstrap': False, 'min_samples_leaf': 3, 'n_estimators': 50, 'min_samples_split': 10, 'max_features': 'sqrt', 'max_depth': 6} model = RandomForestRegressor(**parameters) model.fit(X_train, y_train) print(model.predict(X_test).astype(int), y_test)

1条回答

网友

1楼 · 发布于 2024-09-29 17:23:43

这样想，如果您想测试模型的拟合优度，那么您必须提前知道结果，这样您就可以测量（模型）输出和实际结果之间的距离，并执行必要的调整以提高模型的整体性能。你知道吗

一旦你训练了你的模型，如果你想预测未来的价值，那么（在你不知道你在做什么的情况下）你应该给你的模型提供训练过的相同的特征，但是你要用你将要做预测的数据。下面是一个非常基本的例子，使用两个变量来预测两个团队（a和B）的得分：

import pandas as pd 
data = {'Temperature':[10,20,30,25],'Humidity':[40,50,80,65],'Score_A':[1,2,3,2],'Score_B':[6,3,1,2]}
from sklearn.ensemble import RandomForestRegressor
from sklearn.model_selection import train_test_split
df = pd.DataFrame(data)
print(df)
X = df[['Temperature','Humidity']]
Y = df[['Score_A','Score_B']]
X_train, X_test, y_train, y_test = train_test_split(X, Y,random_state=42)
model = RandomForestRegressor(random_state=42)
model.fit(X_train, y_train)

在这里，我已经训练了我的模型，所以如果我想做一个未来的预测，我需要通过我在训练中使用的相同的特性（温度和湿度），但是要用我想做预测的值。假设我们的朋友气象学家说下一场比赛的温度和湿度分别是35和70。所以我需要将.predict()与这些值一起使用：

model.predict(print(model.predict([[35,70]]))

返回以下输出：

[[2.6 1.4]]

如果你想让它更华丽：

prediction = model.predict([[35,70]])
print("Team A will score: ",prediction[0][0])
print("Team B will score: ",prediction[0][1])

Team A will score:  2.6
Team B will score:  1.4

相关问题更多 >

编程相关推荐

热门问题

热门文章