在Pandas中有效地分组、编辑和重新加入群组

+----+-------+---+---+---+ | | A | B | C | D | +----+-------+---+---+---+ | 0 | Green | 1 | 4 | 5 | | 1 | Red | 2 | 3 | 2 | | 2 | Red | 1 | 4 | 3 | | 3 | Green | 2 | 2 | 2 | | 4 | Green | 1 | 1 | 1 | | 5 | Blue | 2 | 1 | 5 | | 6 | Red | 2 | 1 | 6 | | 7 | Blue | 7 | 8 | 9 | | 8 | Green | 7 | 6 | 5 | | 9 | Red | 0 | 9 | 0 | | 10 | Blue | 4 | 5 | 4 | +----+-------+---+---+---+

group_list = [] g = df.groupby("A") for i, group in g: ###Perform some weird operation on group that can't really be reduced to a #lambda function applied to each group. group_list.append(group) reconstituted = group_list[0] for i in range(1,len(group_list)): reconstituted = reconstituted.append(group_list[i], ignore_index=True)

2条回答

网友

1楼 · 编辑于 2024-09-29 17:17:20

在不知道函数做什么的情况下，如果您只想将它们连接回去，则可以使用^{}：

df_new = pd.concat(group_list)

MVCE公司：

In [77]: df1
Out[77]: 
   0
0  a
1  b

In [78]: df2
Out[78]: 
   0
0  c
1  d

In [79]: pd.concat([df1, df2], ignore_index=True)
Out[79]: 
   0
0  a
1  b
0  c
1  d

但是，我强烈建议您考虑一种不同的技术，它不涉及明确地拆分组并分别处理它们，这是非常低效的。你知道吗

网友

2楼 · 编辑于 2024-09-29 17:17:20

以下代码可以按A列的值提取值

import pandas as pd

df = pd.DataFrame([{'A': 'Green', 'B': 1}, {'A': 'Red', 'B': 2}, {'A': 'Green', 'B': 3}])

for value in df.A.unique():
    print(df[df.A == value])

如果不想将它们合并回df，可以按列A对值进行排序

df.sort_values("A")

您可以得到以下结果：

       A  B
0  Green  1
2  Green  3
1    Red  2

相关问题更多 >

编程相关推荐

热门问题

热门文章