选择python中字符串比较的最大JaroWinkler相似性

2024-10-02 16:29:09 发布

您现在位置:Python中文网/ 问答频道 /正文

我对使用python有点陌生。这是我想与数据帧TData中的数据进行比较的字符串EmploymentName

import  textdistance
import  pandas as PD

EmployerName="MIDWEST UNDERGROUND SUPPLY"
TData[['EmployerName','EmploymentStatus']]

    EmployerName                                        EmploymentStatus
0   ups                                                 No Longer Employed
1   midwest underground supply llc                      Inactive
2   us department of veterans affairs-office of fi...   Inactive
3   us department of homeland security                  Inactive
4   towne park, ltd.                                    Separated

我想使用textdistance.jaro_winkler比较字符串EmployerName与TData中的EmployerName,并选择最大分数和EmploymentStatus

 textdistance.jaro_winkler(EmployerName,TData['EmployerName'][0])
0.41452991452991456

我觉得我应该做一个循环,但我不知道怎么做

谢谢


Tags: of数据字符串importdepartmentuswinklerjaro