擅长:python、mysql、java
<p>可以使用difflib计算距离</p>
<pre><code>import difflib as dfl
dfl.SequenceMatcher(None,'John Doe', 'John doe').ratio()
</code></pre>
<p>编辑:与熊猫的集成:</p>
<pre><code>import pandas as pd
import difflib as dfl
df = pd.DataFrame({'A': ["john doe", " john doe", 'John'], 'B': [' john doe', 'eddie murphy', 'batman']})
df['VAR1'] = df.apply(lambda x : dfl.SequenceMatcher(None, x['A'], x['B']).ratio(),axis=1)
</code></pre>