包含与特定值匹配的行的嵌套循环

date name movie 0 2018-01-16 10:33:59 Alicia Titanic 1 2018-01-17 08:49:13 Chandler Avatar 2 2018-01-18 09:29:09 Luigi Glass 3 2018-01-19 09:45:27 Alicia Die Hard 4 2018-01-20 10:08:05 Bouchra Pulp Fiction 5 2018-01-26 10:21:47 Bariza Glass 6 2018-01-27 10:15:32 Peggy Bumbleblee 7 2018-01-20 10:08:05 John Titanic 8 2018-01-26 10:21:47 Bariza Glass 9 2018-01-27 10:15:32 John Titanic

date name movie 0 2018-01-16 10:33:59 Alicia Titanic 7 2018-01-20 10:08:05 John Titanic 9 2018-01-27 10:15:32 John Titanic

alicias = df[df['Name'] == 'Alicia'] df_res = pd.DataFrame(columns=df.columns) for i in alicias.index: df_res = df_res.append(alicias.loc[i], sort=False) df_johns = df[(df['Date'] > alicias['Date'][i]) &(df['Name'] == 'John') &(df['Movie'] == alicias['Movie'][i)] df_res = df_res.append(df_johns, sort=False)

1条回答

网友

1楼 · 发布于 2024-06-26 13:44:50

这里有一个方法。假设您有以下数据帧：

     date      user    movie
0  2018-01-02  Alicia  Titanic
1  2018-01-13    John  Titanic
2  2018-01-22    John  Titanic
3  2018-04-02    John   Avatar
4  2018-04-05  Alicia   Avatar
5  2018-05-19    John   Avatar

正确的解决方案不应该包含第3行，因为Alicia还没有看到Avatar。所以你可以：

df[df.user.eq('Alicia').groupby(df.movie).cumsum()]

     date       user    movie
0  2018-01-02  Alicia  Titanic
1  2018-01-13    John  Titanic
2  2018-01-22    John  Titanic
4  2018-04-05  Alicia   Avatar
5  2018-05-19    John   Avatar

说明：

下面返回True，其中user是Alicia：

df.user.eq('Alicia')

0     True
1    False
2    False
3    False
4     True
5    False
Name: user, dtype: bool

您现在可以做的是^{}电影，并在组上应用^{}，因此只有第一个True之后的行也将变成True：

0     True
1     True
2     True
3    False
4     True
5     True
Name: user, dtype: bool

最后在原始数据帧上使用boolean indexation，以便选择感兴趣的行

相关问题更多 >

编程相关推荐

热门问题

热门文章