如何打印数据帧列中不同真值组的第一个和最后一个索引

pos = 0 for column in df: try: colname = faults[df.columns[pos]] print "The fault -" +str (colname)+ "- occurred on:" except Exception: pass try: print df.loc[df[column] == True, 'Date'].iloc[:] except TypeError: pass print pos += 1

1条回答

网友
1楼 · 发布于 2024-09-26 18:08:07

以下是我的建议，请浏览列表以找到开始和结束（如果需要，请添加第一个和最后一个）并压缩它们：
df = pd.DataFrame() df['rule_1'] = [0]*13 df['rule_2'] = [0,0,1,1,1,0,0,0,1,1,1,1,0] df['rule_3'] = [1]*13 df.index = pd.date_range("2017-12-25 00:00", "2017-12-25 03:00", freq='0.25H') for col in df.columns: starts = [i for i,x in list(enumerate(df[col].values))[1:-1] if ((x==1)&(df[col].values[i-1]==0))] ends = [i for i,x in list(enumerate(df[col].values))[1:-1] if ((x==1)&(df[col].values[i+1]==0))] if df[col].values[0]==1: starts = [0]+starts if df[col].values[-1]==1: ends = ends + [-1] print (col) for x in zip(df.index[starts], df.index[ends]): print(x) print()
输出：
规则1
规则2
（时间戳（'2017-12-25 00:30:00'），时间戳（'2017-12-25 01:00:00'））
（时间戳（'2017-12-25 02:00:00'），时间戳（'2017-12-25 02:45:00'））
规则3
（时间戳（'2017-12-25 00:00:00'），时间戳（'2017-12-25 03:00:00'））

相关问题更多 >

编程相关推荐

热门问题

热门文章