使用通配符对多个列进行计数

2条回答

网友

1楼 · 编辑于 2024-10-02 18:19:49

您可以使用pandas.Series.str.contains

df_merged['Contact has Email'] = df_merged['Store Contact A'].str.contains('@', na=False)|df_merged['Store B Contact'].str.contains('@', na=False)

网友

2楼 · 编辑于 2024-10-02 18:19:49

您可以使用filter选择其中包含Contact的列，然后使用str.contains和右边的pattern for email address，最后您希望每行有any，因此：

#data sample
df_merged = pd.DataFrame({'id': [0,1,2,3], 
                          'Store A': list('abcd'),
                          'Store Contact A':['aa@bb.cc', '', 'e', 'f'], 
                          'Store B': list('ghij'),
                          'Store B Contact':['kk@ll.m', '', 'nn@ooo.pp', '']})

# define the pattern as in the link
pat = r"^[a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+\.[a-zA-Z0-9-.]+$"
# create the column as wanted
df_merged['Contact has Email'] = df_merged.filter(like='Contact')\
                                          .apply(lambda x: x.str.contains(pat))\
                                          .any(1)

print (df_merged)
   id Store A Store Contact A Store B Store B Contact  Contact has Email
0   0       a        aa@bb.cc       g         kk@ll.m               True
1   1       b                       h                              False
2   2       c               e       i       nn@ooo.pp               True
3   3       d               f       j                              False

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用通配符对多个列进行计数

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >