<p>将<a href="https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.drop_duplicates.html" rel="nofollow noreferrer">^{<cd1>}</a>和<a href="https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.duplicated.html" rel="nofollow noreferrer">^{<cd2>}</a>与<code>keep=False</code>一起使用:</p>
<p>给定<code>df</code>:</p>
<pre><code> name rs number
0 11 5566 64882
1 41 534326 5345
2 11 5566 3312
3 44 2341 5553
4 1 6223 2333
</code></pre>
<p>使用<code>drop_duplicates</code>:</p>
<pre><code>uniq_df = df.drop_duplicates('rs', False)
print(uniq_df)
name rs number
1 41 534326 5345
3 44 2341 5553
4 1 6223 2333
</code></pre>
<p>使用<code>duplicated</code>:</p>
<pre><code>dup_df = df[df.duplicated('rs', False)]
print(dup_df)
name rs number
0 11 5566 64882
2 11 5566 3312
</code></pre>
<hr/>
<p>或者更简单,只使用<code>df.duplicated('rs', False)</code>:</p>
<pre><code>ind = df.duplicated('rs', False)
print(df[~ind])
name rs number
1 41 534326 5345
3 44 2341 5553
4 1 6223 2333
print(df[ind])
name rs number
0 11 5566 64882
2 11 5566 3312
</code></pre>