Python中Float错误的文本无效

train, test = train_test_split(h1, test_size = 0.5, random_state=0) my_features = ['bedrooms', 'bathrooms', 'sqft_living', 'sqft_lot', 'floors', 'zipcode'] trainInp = train[my_features] target = ['price'] trainOut = train[target] regr = LinearRegression() # Train the model using the training sets regr.fit(trainInp, trainOut) print('Coefficients: \n', regr.coef_) testPred = regr.predict(test)

1条回答

网友

1楼 · 发布于 2024-09-27 18:19:14

您的问题是，您正在将模型拟合到整个数据帧中选定的一组特性上（您可以trainInp = train[my_features]），但您试图预测完整的特性集（regr.predict(test)），包括非数字特性，如date。在

因此，与其做regr.predict(test)，不如做regr.predict(test[my_features])。更一般地说，请记住，无论您对训练集应用什么样的预处理（规范化、特征选择、PCA…），您也应该应用于测试集。在

或者，在进行列车测试拆分之前，您可以缩减到感兴趣的特性集：

my_features = ['bedrooms', 'bathrooms', ...]
train, test = train_test_split(h1[my_features], test_size = 0.5, random_state=0)

相关问题更多 >

编程相关推荐

热门问题

热门文章