擅长:python、mysql、java
<p>我会做如下的事情:</p>
<pre><code>def filter_outliers(df, columns):
masks = (make_mask(df, column) for column in columns)
full_mask = np.logical_or.reduce(masks)
return df[full_mask]
def make_mask(df, column):
standardized = (df[column] - df[column].mean())/df[column].std()
return standardized.abs() >= 2
</code></pre>