sklearn.linear\u model.LinearRegression如何在多重共线性数据集中找到解决方案

1条回答

网友

1楼 · 发布于 2024-09-29 23:17:30

您可以尝试Ridge regression，这是LinearRegression加上L2正则化。L2正则化意味着它不仅最小化到目标的平方距离，而且还试图保持系数较小。这使得它对于共线数据集更加稳定

从文档中：

Ridge regression addresses some of the problems of Ordinary Least Squares by imposing a penalty on the size of the coefficients. The ridge coefficients minimize a penalized residual sum of squares:
The complexity parameter controls the amount of shrinkage: the larger the value of , the greater the amount of shrinkage and thus the coefficients become more robust to collinearity.

见https://scikit-learn.org/stable/modules/linear_model.html#ridge-regression 和 https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.Ridge.html

我不同意你的看法

Antoine Dubuis: LinearRegression is using least square and therefore requires having an invertible X.T @ X.

也许它不是那样的意思，但从技术上讲，你必须有一个可逆的协方差矩阵来做linear regression/least squares。例如，您可以使用QR分解来完成。我猜他的意思是，这对于^{中的实现是必要的。但我认为做一些正规化是明智的。这样想：如果你的问题是你的问题有很多解决方案（这就是共线的意思），你想怎么解决？你想选择任何一种解决方案，还是想提出一个标准，说明哪种解决方案是最好的，例如，最常见的

相关问题更多 >

编程相关推荐

热门问题

热门文章

sklearn.linear\u model.LinearRegression如何在多重共线性数据集中找到解决方案

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >