基于其他列值标记行

2条回答

网友

1楼 · 编辑于 2024-10-01 09:17:21

对于Pandas，最好使用按列计算；apply和自定义函数一起表示一个低效的、Python级别的按行循环。你知道吗

df = pd.DataFrame({'street_name': ['Malborough Road', '123 Fake Road', 'My Street'],
                   'eircode': ['BLT12', None, None]})

cond1 = df['eircode'].isnull()
cond2 = ~df['street_name'].str.split(n=1).str[0].str.isdigit()

df['unique'] = np.where(cond1 & cond2, 'no', 'yes')

print(df)

  eircode      street_name unique
0   BLT12  Malborough Road    yes
1    None    123 Fake Road    yes
2    None        My Street     no

网友

2楼 · 编辑于 2024-10-01 09:17:21

可以使用|操作符提供这些单独的条件，然后将生成的布尔数组映射到yes和no。第一个条件只是查看eircode是否为null，第二个条件使用正则表达式检查street_name是否以数字开头：

df['unique'] = ((~df.eircode.isnull()) | (df.street_name.str.match('^[0-9]'))).map({True:'yes',False:'no'})
>>> df
       street_name eircode unique
0  Malborough Road   BLT12    yes
1    123 Fake Road     NaN    yes
2        My Street     NaN     no

相关问题更多 >

编程相关推荐

热门问题

热门文章

基于其他列值标记行

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >