从旧数据帧创建新数据帧，其中新数据帧包含旧数据帧中不同位置的行平均列

newdf= frame[['sample']].copy() newdf['rep_avg']=frame.iloc[:, [1,5]].mean(axis=1) #average row-wise newdf['repcycle_avg']=frame.iloc[:, [2,6]].mean(axis=1) newdf['set_avg']=frame.iloc[:, [3,7]].mean(axis=1) #average row-wise newdf['setcycle_avg']=frame.iloc[:, [4,8]].mean(axis=1) newdf.columns = ['S', 'Re', 'Rec', 'Se', 'Sec']

1条回答

网友
1楼 · 发布于 2024-10-05 10:21:25

根据您的要求，“我更愿意自动化此过程，因为其他数据文件也会重复此过程” 我能想到的是：
in [1]: frame = pd.read_csv('your path')
结果如下所示，现在您可以看到要求平均值的是第1、5和2、6列，依此类推
out [1]: sample rep repcycle set setcycle rep1 repcycle1 set1 setcycle1 0 66 40 4 5 3 40 4 5 3 1 78 20 5 6 3 20 5 6 3 2 90 50 6 9 4 50 6 9 4 3 45 70 7 3 2 70 7 7 2
因此，我们需要创建两个列表
in [2]: import numpy as np list_1 = np.arange(1,5,1).tolist() in [3]: list_1 out[3]: [1,2,3,4]
这是您希望平均[rep，repcycle，set，setcycle]的前半部分
in [4]: list_2 = [x+4 for x in list_1] in [5]: list_2 out[5]: [5,6,7,8]
这是您想要平均的下半部分[rep1，repcycle1，set1，setcycle1]
in [6]: result = pd.concat([frame.iloc[:, [x,y].mean(axis=1) for x, y in zip(list_1,list_2)],axis=1) in [7]: result.columns = ['Re', 'Rec', 'Se', 'Sec']
现在你得到了你想要的，而且它是自动化的，你所需要做的就是改变上面的两个列表
in [8]: result out[8]: Re Rec Se Sec 0 40.0 4.0 5.0 3.0 1 20.0 5.0 6.0 3.0 2 50.0 6.0 9.0 4.0 3 70.0 7.0 5.0 2.0

相关问题更多 >

编程相关推荐

热门问题

热门文章