使用pandas获取该行中第一个非零值的列名

import pandas as pd df=pd.read_csv('sample.csv') df_union=pd.DataFrame(columns=['cnum','supcol']) for col in df.columns: df1=df.filter(['cnum']).loc[df[col] == 1] df1['supcol']=col df_union=df_union.append(df1) print(df_union)

1条回答

网友

1楼 · 发布于 2024-09-28 03:15:33

似乎您可以在此处使用idxmax：

df.set_index('cnum').idxmax(axis=1).reset_index(drop=True)

0    sup1
1    sup1
2    sup3
3    sup2
dtype: object

df['output'] = df.set_index('cnum').idxmax(axis=1).reset_index(drop=True) 
# Slightly faster,
# df['output'] = df.set_index('cnum').idxmax(axis=1).to_numpy() 

df
         cnum  sup1  sup2  sup3  sup4 output
0   285414459     1     0     1     1   sup1
1   445633709     1     0     0     0   sup1
2   556714736     0     0     1     0   sup3
3  1089852074     0     1     0     1   sup2

另一个带有dot的选项（将提供所有非零列）：

d = df.set_index('cnum') 
d.dot(d.columns + ',').str.rstrip(',').reset_index(drop=True)

0    sup1,sup3,sup4
1              sup1
2              sup3
3         sup2,sup4
dtype: object

或者

(d.dot(d.columns + ',')
  .str.rstrip(',')
  .str.split(',', 1).str[0] 
  .reset_index(drop=True))

0    sup1
1    sup1
2    sup3
3    sup2
dtype: object

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用pandas获取该行中第一个非零值的列名

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >