<p>有没有办法识别数据帧中的虚拟数据并将其删除?在我下面的数据中,我需要删除每列中的随机字符</p>
<pre><code>import pandas as pd
import numpy as np
data = {'Name' : ['Tom', 'AABBCC', 'Joseph', 'Krish', 'XXXX', 'John', 'U'],
'Address1': ['High Street', 'uwdfjfuf', '00000', 'Green Lane', 'Kingsway', 'Church Street', 'iwefwfn'],
'Address2': ['Park Avenue', 'The Crescent', 'ABCXYZ', 'Highfield Road', 'Stanley Road', 'New Street', '1ca2s597']}
contact_details = pd.DataFrame(data)
#Code to identify and delete dummy data
print(contact_details)
</code></pre>
<p>上述代码的输出:</p>
<pre><code> Name Address1 Address2
0 Tom High Street Park Avenue
1 AABBCC uwdfjfuf The Crescent
2 Joseph 00000 ABCXYZ
3 Krish Green Lane Highfield Road
4 XXXX Kingsway Stanley Road
5 John Church Street New Street
6 U iwefwfn 1ca2s597
</code></pre>
<p>你调查过你的数据吗?“好数据”总是由小写和大写字符组合而成吗?如果是这样,您可以创建一个函数来查找这些虚拟数据,例如:</p>
<pre><code>if text.lower() == text or text.upper() == text:
# text is dummy
</code></pre>