在Python中对行（日期）进行分组并汇总多个列（每个日期的几个测量值）

import psycopg2 as ps import pandas as pd import openpyxl conn = ps.connect(host="host", user="user", password="password", dbname="Python_ueben") cur = conn.cursor() print('connect') """ schema = input("Geben Sie das Schema ein") table = input(" Geben Sie die Tabele ein") """ def load_data(schema, table): sql_command = "SELECT * FROM {}.{};".format(str(schema), str(table)) print (sql_command) # Load the data data = pd.read_sql(sql_command, conn) groub = data.groupby(['date']) # group Date and save in variable print(data.sum(axis=1, skipna=True)) #sum values v00 - v03 print(groub.sum(axis=1, skipna=True)) # group and sum, but not the right result #print(data.groupby(['date']).sum(axis=0, skipna=False)) load_data('public', 'zeitreihe')

1条回答

网友

1楼 · 发布于 2024-06-22 10:32:30

如果Date是列的第一个聚合和，然后是每个axis=1的sum：

df1 = df.groupby('Date').sum().sum(axis=1).reset_index(name='sum')
print (df1)
                  Date  sum
0  2001-01-01 00:00:00  500
1  2001-02-01 00:00:00  160

或者通过Date列创建index，然后对所有行和每个索引的最后一个总和（每个索引的总和）：

df1 = df.set_index('Date').sum(axis=1).sum(level=0).reset_index(name='sum')

如果Date是索引，则上面的解决方案是simplify：

df1 = df.sum(axis=1).sum(level=0).reset_index(name='sum')

df1 = df.sum(level=0).sum(axis=1).reset_index(name='sum')

相关问题更多 >

编程相关推荐

热门问题

热门文章