python迭代循环遍历datafram的列

week hour week_hr store_code baskets 0 201616 106 201616106 505 0 1 201616 107 201616107 505 0 2 201616 108 201616108 505 0 3 201616 109 201616109 505 18 4 201616 110 201616110 505 0 5 201616 106 201616108 910 0 6 201616 107 201616106 910 0 7 201616 108 201616107 910 2 8 201616 109 201616108 910 3 9 201616 110 201616109 910 10

2条回答

网友
1楼 · 编辑于 2024-09-28 17:21:55

执行以下操作：
按门店代码、周/小时排序
按0筛选
将减法存储在df['week_hr'][1:]之间。值df['week_hr'][:-1]。值之间，这样您就可以知道它们是否连续。在
现在，您可以将组设置为连续，并根据需要进行过滤。在
import numpy as np import pandas as pd # 1 t1 = df.sort_values(['store_code', 'week_hr']) # 2 t2 = t1[t1['baskets'] == 0] # 3 continuous = t2['week_hr'][1:].values-t2['week_hr'][:-1].values == 1 groups = np.cumsum(np.hstack([False, continuous==False])) t2['groups'] = groups # 4 t3 = t2.groupby(['store_code', 'groups'], as_index=False)['week_hr'].count() t4 = t3[t3.week_hr > 2] print pd.merge(t2, t4[['store_code', 'groups']])
不需要循环！在

网友
2楼 · 编辑于 2024-09-28 17:21:55

您可以解决：
按门店代码、周/小时排序
按0筛选
按门店代码分组
找到连续的
代码：
t1 = df.sort_values(['store_code', 'week_hr']) t2 = t1[t1['baskets'] == 0] grouped = t2.groupby('store_code')['week_hr'].apply(lambda x: x.tolist()) for store_code, week_hrs in grouped.iteritems(): print(store_code, week_hrs) # do something

相关问题更多 >

编程相关推荐

热门问题

热门文章