擅长:python、mysql、java
<p>类似这样的解决方案可能就是您正在寻找的解决方案:</p>
<pre><code>import pandas as pd
series = [
('a@a.com','Bill', 'Schneider', 123, 321, 20190502),
('a@a.com', 'Damian', 'Schneider', 124, 231, 20190502),
('b@b.com', 'Bill', 'Schneider',164, 313, 20190503)
]
# Create a DataFrame object
df = pd.DataFrame(series, columns=['email', 'first name', 'last name', 'C_ID', 'A_ID', 'CreatedDate'])
# Find duplicate rows
df_duplicates = df[df.email.duplicated()]
print(df_duplicates)
</code></pre>