Pandas连接上一个当前和下一个tex

Event_id,reportTime,ReportValX,ReportValY,ReportText 1,13_01,13,1,Man Arrived near the car 1,13_02,13,2.2,The Car was fast 1,13_02,13,2.1,The lights were on. 1,13_03,13,3,The man hit the car 2,13_01,13,1,Cat was on the mat 2,13_02,13,2.2,mat was red 2,13_03,13,3.1,Dad is a man 2,13_03,13,3,Dad has a hat

Event_id,reportTime,ReportValX,ReportValY,ReportText_Before,Reprt Text_Current,Report Text Same Time,Report Text Later 1,13_01,13,1,,Man Arrived near the car,Man Arrived near the car,The Car was fast. <NEXT> The lights were on. <NEXT> The man hit the car 1,13_02,13,2.2,Man Arrived near the car,The Car was fast,The Car was fast <NEXT> The lights were on.,The man hit the car

1条回答

网友
1楼 · 发布于 2024-09-28 17:04:57

你需要使用groupby和apply来实现你想要的。你知道吗
首先，创建两个新列'Hour'和'Minute'，以便更容易地标识时间。你知道吗
df["Hour"], df["Minute"] = zip(*df['reportTime'].apply(lambda x : list(map(int, x.split('_')))))
然后编写一个自定义函数来生成新列，并使用groupby和apply来使用它。你知道吗
def makerep(x): res = x[['Event_id','reportTime','ReportValX','ReportValY']] res['ReportText_Before'] = x.apply(lambda el : '<NEXT>'.join(x['ReportText'].loc[x['Minute'] < el['Minute']]), axis=1) res['ReportText_Current'] = x['ReportText'] res['ReportText_SameTime'] = x.apply(lambda el : '<NEXT>'.join(x['ReportText'].loc[x['Minute'] == el['Minute']]), axis=1) res['ReportText_Later'] = x.apply(lambda el : '<NEXT>'.join(x['ReportText'].loc[x['Minute'] > el['Minute']]), axis=1) return res ddf = df.groupby(['Event_id', 'Hour']).apply(makerep)
您可以在'Event_id'和'Hour'上分组，并在连接新列中的字符串时使用'Minute'选择较早、相同或较晚的分钟数。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章