使用pandas多次循环数据帧的最快方式

HOME_TEAM AWAY_TEAM GOL_HOME_FT GOL_AWAY_FT ... 1153 Manchester United Swansea City 1 2 1163 Leicester City Everton 2 2 1172 Queens Park Rangers Hull City 0 1 1183 Stoke City Aston Villa 0 1 1193 West Bromwich Albion Sunderland 2 2

for team in list_teams: # list_teams is all the different teams in my df df_team = df.loc[(df.HOME_TEAM.values == team) | (df.AWAY_TEAM.values == team)] lst_bool = df_team.GOL_HOME_FT.values > df_team.GOL_AWAY_FT.values consecutive_true = 0 lst_consecutive_true = [] for i in lst_bool: if i is True: consecutive_true += 1 else: lst_consecutive_true.append(consecutive_true) consecutive_true = 0 max_consecutive_true = max(lst_consecutive_true)

1条回答

网友

1楼 · 发布于 2024-09-29 03:30:40

您可以尝试类似这样的方法，而不是循环。这是我使用的数据

HOME_TEAM                 AWAY_TEAM GOL_HOME_FT GOL_AWAY_FT lst_bool
ManchesterUnited          SwanseaCity   1          2    False
ManchesterCity            Everton       5          2    True
ManchesterParkRangers     HullCity      0          1    False
ManchesterCity            AstonVilla    0          1    False
ManchesterBromwichAlbion  Sunderland    2          2    False
ManchesterUnited          SwanseaCity   3          2    True
ManchesterUnited          SwanseaCity   3          2    True
ManchesterParkRangers     HullCity      2          1    True

我用这个df['lst_bool'] = df['GOL_HOME_FT']> df['GOL_AWAY_FT']得到了最后一列

然后找到连续的真与假，你可以这样做。 (df.groupby(df.lst_bool.ne(df.lst_bool.shift()).cumsum()).agg({'lst_bool': 'count'})).reset_index(drop=True)

这给了我下面的输出

    lst_bool
0   1
1   1
2   3
3   3

相关问题更多 >

编程相关推荐

热门问题

热门文章