查找每个唯一值的多列模式

ID Service1 Service2 Service3 .... Service10 Premium ---------------------------------------------------------------- 1 A B C Z XX 1 B C D Y XY 1 A B C O XX 2 R S T B XX

def servicemode(group): svcs_cols = [group['Service1'], group['Service2'], group['Service3'], group['Service4'], group['Service5'], group['Service6'], group['Service7'], group['Service8'], group['Service9'], group['Service10']] return pd.concat(dx_cols).dropna(inplace=False).agg(lambda x: pd.Series.mode(x)[0]) df.groupby('ID').apply(servicemode)

1条回答

网友

1楼 · 发布于 2024-05-02 18:28:58

我不确定您的原始代码有什么问题，但这里有一个解决方案：

import pandas as pd
from itertools import chain

>>>df
   Service1 Service2 Service3 Service10
ID
1         A        B        C         Z
1         B        C        D         Y
1         A        B        C         O
2         R        S        T         B

df_regsvc = df.groupby(df.index)['Service1','Service2','Service3','Service10'] \
    .apply(lambda x : list(chain.from_iterable([*x.values]))) \
    .apply(lambda x: max(x, key=x.count)).to_frame()

>>>df_regsvc
ID
1    B
2    R
dtype: object

# Join it with the aggregate for the Premium column
df_premium = df.groupby(df.index)['Premium'].agg(lambda x: pd.Series.mode(x)[0]).to_frame()
df_agg = df_regsvc.join(df_premium)

>>>df_agg
    0 Premium
ID
1   B         XX
2   R         XX

相关问题更多 >

编程相关推荐

热门问题

热门文章