我有以下数据:
我正在尝试使用库-pyjarowinkler 并找到字符串之间的距离-我的hello world代码有效
#Hello World
d1=distance.get_jaro_distance("Hello","hello", winkler=True, scaling=0.1);
d1
当我尝试迭代每一行或使用apply时,我的代码失败。谁能给我指一下正确的方向吗
#Import data
import pandas
df = pandas.read_csv('data.csv')
from pyjarowinkler import distance
score=df.apply(distance.get_jaro_distance(df[S1],df[Stores]))
# iterating over rows using iterrows() function
for i, j in df.iterrows():
print(i, j,distance.get_jaro_distance(i,j,winkler=True, scaling=0.1))
print()
错误:
JaroDistanceException: Cannot calculate distance from NoneType (int, Series)
预期产出为:
我想你应该能做到
注意
axis=1
参数被传递给.apply
,这告诉它按行而不是按列操作df相关问题 更多 >
编程相关推荐