在DataFrame列中用子字符串替换字符串

strings matched strings 0 text1C1 text1 1 text2A text2 2 text2 text2 3 text4 text4 4 text4B text4B 5 text4A3 text4

1条回答

网友

1楼 · 发布于 2024-09-30 18:33:38

将生成器与next一起用作匹配第一个值：

s = vals[::-1]
df['matched strings1'] = df['strings'].apply(lambda x: next(y for y in s if y in x))
print (df)
   strings matched strings matched strings1
0  text1C1           text1            text1
1   text2A           text2            text2
2    text2           text2            text2
3    text4           text4            text4
4   text4B          text4B           text4B
5  text4A3           text4            text4

更一般的解决方案，如果可能的话，没有匹配的值与iter和默认参数next：

f = lambda x: next(iter(y for y in s if y in x), 'no match')
df['matched strings1'] = df['strings'].apply(f)

应改进您的解决方案：

for v in vals:
    df.loc[df['strings'].str.contains(v, regex=False), 'matched strings'] = v

相关问题更多 >

编程相关推荐

热门问题

热门文章

在DataFrame列中用子字符串替换字符串

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >