在我使用scikit_learn和pandas训练了一个模型之后，我如何预测未来的数据（在我的例子中是降雨）？问题的回答

在我使用scikit_learn和pandas训练了一个模型之后，我如何预测未来的数据（在我的例子中是降雨）？

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我正在训练一个模型来预测未来的降雨数据。我已经完成了模型的培训。我正在使用这个数据集：<a href="https://www.kaggle.com/redikod/historical-rainfall-data-in-bangladesh" rel="nofollow noreferrer">https://www.kaggle.com/redikod/historical-rainfall-data-in-bangladesh</a> 看起来是这样的： <pre><code> Station Yea Month Day Rainfall dayofyear 1970-01-01 1 Dhaka 1970 1 1 0 1 1970-01-02 1 Dhaka 1970 1 2 0 2 1970-01-03 1 Dhaka 1970 1 3 0 3 1970-01-04 1 Dhaka 1970 1 4 0 4 1970-01-05 1 Dhaka 1970 1 5 0 5 </code></pre> 我使用在线找到的代码作为参考，使用训练和测试数据完成了培训。然后我也检查了预测值和真实值 这是密码 <pre><code>import numpy as np import pandas as pd import matplotlib.pyplot as plt import seaborn as sns import tensorflow as tf #data is in local folder df = pd.read_csv("data.csv") df.head(5) df.drop(df[(df['Day']>28) & (df['Month']==2) & (df['Year']%4!=0)].index,inplace=True) df.drop(df[(df['Day']>29) & (df['Month']==2) & (df['Year']%4==0)].index,inplace=True) df.drop(df[(df['Day']>30) & ((df['Month']==4)|(df['Month']==6)|(df['Month']==9)|(df['Month']==11))].index,inplace=True) date = [str(y)+'-'+str(m)+'-'+str(d) for y, m, d in zip(df.Year, df.Month, df.Day)] df.index = pd.to_datetime(date) df['date'] = df.index df['dayofyear']=df['date'].dt.dayofyear df.drop('date',axis=1,inplace=True) df.head() df.size() df.info() df.plot(x='Year',y='Rainfall',style='.', figsize=(15,5)) train = df.loc[df['Year'] <= 2015] test = df.loc[df['Year'] == 2016] train=train[train['Station']=='Dhaka'] test=test[test['Station']=='Dhaka'] X_train=train.drop(['Station','StationIndex','dayofyear'],axis=1) Y_train=train['Rainfall'] X_test=test.drop(['Station','StationIndex','dayofyear'],axis=1) Y_test=test['Rainfall'] from sklearn import svm from sklearn.svm import SVC model = svm.SVC(gamma='auto',kernel='linear') model.fit(X_train, Y_train) Y_pred = model.predict(X_test) df1 = pd.DataFrame({'Actual Rainfall': Y_test, 'Predicted Rainfall': Y_pred}) df1[df1['Predicted Rainfall']!=0].head(10) </code></pre> 在此之后，我尝试实际使用该模型预测未来几天/几个月/几年的降雨量。我使用了一些，比如一些用于股票价格的（在调整代码之后）。但它们似乎都不起作用。因为我已经训练了这个模型，所以我认为预测未来几天是很容易的。假设我用1970-2015年的数据进行培训，用2016年的数据进行测试。现在我想预测2017年的降雨量。差不多吧 我的问题是，我如何以直观的方式做到这一点 如果有人能回答这个问题，我将不胜感激 编辑@Mercury: 这是使用该代码后的实际结果。我怀疑模型是否在运行。。。这是实际结果的图像：<a href="https://i.stack.imgur.com/81Vk1.png" rel="nofollow noreferrer">https://i.stack.imgur.com/81Vk1.png</a>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

在我使用scikit_learn和pandas训练了一个模型之后，我如何预测未来的数据（在我的例子中是降雨）？

1 个回答

相关Python问题