<p>我有一个df,现在看起来像这样:</p>
<pre><code>Car Name Number
Adam Leaf 9
Adamm Leaf 9
Adam Lea NaN
Adam-Leaf NaN
Adam/Leaf 9
Claire-Green NaN
Cliare Green 3
Claire Green 3
Claire Gren NaN
Claire/Green 3
</code></pre>
<p>我正在尝试删除这些变体以实现类似的效果</p>
<pre><code>Car Name Number
Adam Leaf 9
Claire Green 3
</code></pre>
<p>这里有一条从<code>jellyfish</code></p>
<pre><code>import jellyfish
s=df.groupby(df['Car Name'].apply(jellyfish.soundex)).first()
Car Name Number
Car Name
A354 Adam Leaf 9.0
C462 Claire-Green 3.0
</code></pre>