根据拆分后拆分的字符串中的元素筛选行（Pandas）

2条回答

网友

1楼 · 编辑于 2024-10-02 12:31:58

您可以使用regex(^|;) *Credit(;|$)来确保模式在分隔符之间是独占的，因此Credit将位于字符串的开头或结尾，或者直接跟在分隔符;后面：

df
   index                                     locations
0  39951                     Credit; Mount Pleasant GO
1  40976  Ajax GO; Whitby GO; Credit; Oshawa GO; Bayly
2  14961             Mount Pleasant GO; Port Credit GO

df.locations.str.contains('(^|;) *Credit(;|$)')
#0     True
#1     True
#2    False
#Name: locations, dtype: bool

如果要进一步忽略大小写，请将修饰符?i添加到模式中：

df.locations.str.contains('(?i)(^|;) *credit(;|$)')
#0     True
#1     True
#2    False
#Name: locations, dtype: bool

网友

2楼 · 编辑于 2024-10-02 12:31:58

您可以尝试（不使用正则表达式）：

#split and explode the dataframe:
m=df['locations'].str.split('; ').explode()
#check your condition and get index where condition satisfies:
m=m[m.isin(['Credit'])].index.unique()
#Finally filter out dataframe:
out=df.loc[m]

现在，如果您打印out，您将得到经过筛选的数据帧

相关问题更多 >

编程相关推荐

热门问题

热门文章

根据拆分后拆分的字符串中的元素筛选行（Pandas）

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >