擅长:python、mysql、java
<p>试试这个,逻辑就是检查字符串是否只能用ASCII字符编码。你知道吗</p>
<pre><code># -*- coding: utf-8 -*-
df = pd.DataFrame({"FirstName":["jocovich","smith"," 西德哈斯","yuvi"," bsreddy","sreeja"],"LastName":["nadhal","pointing","supreet","雷迪","rakshita","巴尔加维"],"blah":["example","example","example","example","example","example"]})
def isEnglish(s):
o = list(s)
ot = []
for n in o:
try:
n.encode(encoding='utf-8').decode('ascii')
except UnicodeDecodeError:
ot.append(False)
else:
ot.append(True)
if ot==[True,False] or ot == [False,True]:
return True
else:
return False
output = df[df[["FirstName","LastName"]].apply(lambda x: isEnglish(x),axis=1)]
</code></pre>
<p>输出</p>
<pre><code> FirstName LastName blah
2 西德哈斯 supreet example
3 yuvi 雷迪 example
5 sreeja 巴尔加维 example
</code></pre>