如何在Pandas中合并数据集中的两行

Output: Borough Major Category numCrimes Year 2008 Barking and Dagenham Burglary 82.0 2008 Barking and Dagenham Burglary 59.0 2008 Barking and Dagenham Criminal Damage 79.0 2008 Barking and Dagenham Criminal Damage 142.0 2008 Barking and Dagenham Criminal Damage 20.0 ... ... ... ... 2018 Westminster Violence Against the Person 386.0 2018 Westminster Violence Against the Person 0.0 2018 Westminster Violence Against the Person 41.0 2018 Westminster Violence Against the Person 38.0 2018 Westminster Violence Against the Person 109.0

3条回答

网友

1楼 · 编辑于 2024-06-26 18:02:13

我认为您需要的是一个非常简单的groupby操作：

grouped = df.groupby(['Year','Borough','Major Category']).sum()

## if you need to get the columns back...
grouped.reset_index()

网友

2楼 · 编辑于 2024-06-26 18:02:13

groupby和agg是这里使用的正确函数，但是我们应该小心不要丢失看起来像df中的索引的“Year”。所以

(df.reset_index()
   .groupby(['Year','Borough','Major Category'], as_index = False)
   .agg(sum)
)

我们应该这样做；对于您的示例数据，它生成


    Year    Borough                 Major Category              numCrimes
0   2008    Barking and Dagenham    Burglary                    141.0
1   2008    Barking and Dagenham    Criminal Damage             241.0
2   2018    Westminster             Violence Against the Person 574.0

网友

3楼 · 编辑于 2024-06-26 18:02:13

df.groupby(["Year", "Borough", "Major Category"]).sum()

或其变体。很确定你在寻找groupby的用法

相关问题更多 >

编程相关推荐

热门问题

热门文章