擅长:python、mysql、java
<p>通过迭代每行中的unicode字符(使用<a href="https://pypi.org/project/emoji/" rel="nofollow noreferrer">emoji</a>和<a href="https://docs.python.org/3/library/unicodedata.html" rel="nofollow noreferrer">unicodedata</a>包),可以检测仅包含<em>emojis的行:</p>
<pre><code>df = {}
df['Comments'] = ["Test", "Hello 😉", "😉😉😉"]
import unicodedata
import numpy as np
from emoji import UNICODE_EMOJI
for i in range(len(df['Comments'])):
pure_emoji = True
for unicode_char in unicodedata.normalize('NFC', df['Comments'][i]):
if unicode_char not in UNICODE_EMOJI:
pure_emoji = False
break
if pure_emoji:
df['Comments'][i] = np.NaN
print(df['Comments'])
</code></pre>