擅长:python、mysql、java
<p>用随机值代替pandas数据帧列中的缺失值很简单。在</p>
<pre><code>mean = df['column'].mean()
std = df['column'].std()
def fill_missing_from_Gaussian(column_val):
if np.isnan(column_val) == True:
column_val = np.random.normal(mean, std, 1)
else:
column_val = column_val
return column_val
</code></pre>
<p>现在只需将上述方法应用于缺少值的列。在</p>
^{pr2}$