数据标准化vs规范化vs Robus问题的回答

数据标准化vs规范化vs Robus

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

<blockquote> Am I right to say that also Standardization gets affected negatively by the extreme values as well? </blockquote> 事实上，你是；scikit学习<a href="http://scikit-learn.org/0.18/auto_examples/preprocessing/plot_robust_scaling.html" rel="noreferrer">docs</a>他们自己清楚地警告这种情况： <blockquote> However, when data contains outliers, <a href="http://scikit-learn.org/0.18/modules/generated/sklearn.preprocessing.StandardScaler.html#sklearn.preprocessing.StandardScaler" rel="noreferrer"><code>StandardScaler</code></a> can often be mislead. In such cases, it is better to use a scaler that is robust against outliers. </blockquote> 或多或少，对于<code>MinMaxScaler</code>也是如此。 <blockquote> I really can't see how the Robust Scaler improved the data because I still have extreme values in the resulted data set? Any simple -complete interpretation? </blockquote> 健壮并不意味着免疫，或不受攻击，缩放的目的是不以“删除”异常值和极值-这是一个单独的任务，有自己的方法；这在<a href="http://scikit-learn.org/stable/auto_examples/preprocessing/plot_all_scaling.html#robustscaler" rel="noreferrer">relevant scikit-learn docs</a>中再次明确提到： <blockquote> RobustScaler [...] Note that the outliers themselves are still present in the transformed data. If a separate outlier clipping is desirable, a non-linear transformation is required (see below). </blockquote> 其中“see below”指的是<a href="http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.QuantileTransformer.html#sklearn.preprocessing.QuantileTransformer" rel="noreferrer">^{<cd2>}</a>和<a href="http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.quantile_transform.html" rel="noreferrer">^{<cd3>}</a>。

数据标准化vs规范化vs Robus

1 个回答

相关Python问题