如何从列中的字符串中提取与python列表中的另一个字符串相匹配的子字符串

col 1 col 2 0 59 538 Walton Avenue, Chester, FY6 7NP 1 62 42 Chesterton Road, Peterborough, FR7 2NY 2 179 3 Wallbridge Street, Essex, 4HG 3HT 3 180 6 Stevenage Avenue, Coventry, 7PY 9NP

col 1 col 2 col3 0 59 538 Walton Avenue, Chester, FY6 7NP Chester 1 62 42 Chesterton Road, Peterborough, FR7 2NY 2 179 3 Wallbridge Street, Essex, 4HG 3HT Essex 3 180 6 Stevenage Avenue, Coventry, 7PY 9NP Coventry

3条回答

网友

1楼 · 编辑于 2024-10-02 02:31:08

依靠这样的辅助功能：

df = pd.DataFrame({'col 1': [59, 62, 179, 180],
                   'col 2': ['538 Walton Avenue, Chester, FY6 7NP',
                             '42 Chesterton Road, Peterborough, FR7 2NY',
                             '3 Wallbridge Street, Essex, 4HG 3HT',
                             '6 Stevenage Avenue, Coventry, 7PY 9NP'
                             ]})

def aux_func(x):

    # split by comma and select the interesting part ([1])
    x = x.split(',')
    x = x[1]

    aux_list = ['Stevenage', 'Essex', 'Coventry', 'Chester']
    for v in aux_list:
        if v in x:
            return v
    return ""

df['col 3'] = [aux_func(name) for name in df['col 2']]

网友

2楼 · 编辑于 2024-10-02 02:31:08

你可以这样做：

city_list = ["Stevenage", "Essex", "Coventry", "Chester"]

def get_match(row):
    col_2 = row["col 2"].replace(",", " ").split() # Here you should process the string as you want
    for c in city_list:
        if difflib.get_close_matches(col_2, c)
            return c
    return ""

df["col 3"] = df.apply(lambda row: get_match(row), axis=1)

网友

3楼 · 编辑于 2024-10-02 02:31:08

查看str.contains函数，该函数测试模式是否与序列匹配：

df = pd.DataFrame([[59, '538 Walton Avenue, Chester,', 'FY6 7NP'],
                   [62, '42 Chesterton Road, Peterborough', '4HG 3HT'],
                   [179, '3 Wallbridge Street, Essex', '4HG 3HT'],
                   [180, '6 Stevenage Avenue, Coventry', '7PY 9NP']])
city_list = ["Stevenage", "Essex", "Coventry", "Chester"]
for city in city_list:
    df.loc[df[1].str.contains(city), 'match'] = city

相关问题更多 >

编程相关推荐

热门问题

热门文章