如何创建running total并在每次出现NaN时重新启动它？

for i in df['Budget_Expenditure_2012_']: if np.isnan(i) == True: x = pd.Index(df['Budget_Expenditure_2012_']).get_loc(i) print(x) for item in range(0, len(x) - 1, 2): second_list.append([x[item],x[item + 1]]) print(second_list)

2条回答

网友

1楼 · 编辑于 2024-10-02 14:16:49

使用这段代码，您可以在一个名为“总计”的新列上获取每个nan的“运行总计”

total = 0
df['Totals'] = 0 # assign 0 initially to all rows of the new column

for i in range(df.shape[0]): # shape[0] return number of rows

    expenditure = df.loc[i+1, 'Budget_Expenditure_2012_'] # i+1 coz your indexing starts at 1

    if np.isnan(expenditure):
        df.loc[i, 'Totals'] = total
        total = 0
    else:
        total += expenditure

网友

2楼 · 编辑于 2024-10-02 14:16:49

使用shift、isna和cumsum的组合来gropuby，然后transform，最后在列为nan的位置分配结果值

df.loc[df['Budget_Expenditure_2012_'].isna(), 'new_column'] = (
    df.groupby(
        df.Budget_Expenditure_2012_.shift()
                                   .isna()
                                   .cumsum()
    )['Budget_Expenditure_2012_'].transform('sum')
)

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何创建running total并在每次出现NaN时重新启动它？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >