具有用户定义函数Pandas的Groupby

people = pd.DataFrame(np.random.randn(5, 5), columns=['a', 'b', 'c', 'd', 'e'], index=['Joe', 'Steve', 'Wes', 'Jim', 'Travis']) def GroupFunc(x): if len(x) > 3: return 'Group1' else: return 'Group2' people.groupby(GroupFunc).sum()

1条回答

网友

1楼 · 发布于 2024-09-28 21:17:19

要按>；1分组，可以定义如下函数：

>>> def GroupColFunc(df, ind, col):
...     if df[col].loc[ind] > 1:
...         return 'Group1'
...     else:
...         return 'Group2'
...

那就叫它像

>>> people.groupby(lambda x: GroupColFunc(people, x, 'a')).sum()
               a         b         c         d        e
Group2 -2.384614 -0.762208  3.359299 -1.574938 -2.65963

或者只能使用匿名函数：

>>> people.groupby(lambda x: 'Group1' if people['b'].loc[x] > people['a'].loc[x] else 'Group2').sum()
               a         b         c         d         e
Group1 -3.280319 -0.007196  1.525356  0.324154 -1.002439
Group2  0.895705 -0.755012  1.833943 -1.899092 -1.657191

如documentation中所述，还可以通过传递提供标签的序列来分组->；组名映射：

>>> mapping = np.where(people['b'] > people['a'], 'Group1', 'Group2')
>>> mapping
Joe       Group2
Steve     Group1
Wes       Group2
Jim       Group1
Travis    Group1
dtype: string48
>>> people.groupby(mapping).sum()
               a         b         c         d         e
Group1 -3.280319 -0.007196  1.525356  0.324154 -1.002439
Group2  0.895705 -0.755012  1.833943 -1.899092 -1.657191

相关问题更多 >

编程相关推荐

热门问题

热门文章

具有用户定义函数Pandas的Groupby

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >