使用pandas重新排列表

| Customer | Email1 |Email2 | | ----------| -------------- |---------------| | 1 | jdoe@mail.com |john_d@mail.com | 2 | jane1@mail.com | | | 3 | adam@mail.com | |

2条回答

网友

1楼 · 编辑于 2024-09-30 01:24:06

您可以使用cumcount创建多索引。然后使用unstack重新格式化数据，并通过add_prefix添加更改列名称：

    df = (df.set_index(['Customer',df.groupby('Customer').cumcount()])['Email']
        .unstack()
        .add_prefix('Email')
        .reset_index())
    print(df)

你会得到你想要的

网友

2楼 · 编辑于 2024-09-30 01:24:06

您可以使用^{}+^{}+^{}

# We get the Customers with more than one email.
df_seconds_email = df[df['Customer'].duplicated()]

# We merge your original dataframe (I called it 'df') and the above one, suffixes param help us to get
# 'Email2' column, finally we drop duplicates taking into account 'Customer' column.
df = pd.merge(df, df_seconds_email, how='left', on=['Customer'], suffixes=('', '2')).drop_duplicates(subset='Customer')
print(df)

输出：

    Customer    Email          Email2
0      1    jdoe@mail.com   john_d@mail.com
1      2    jane1@mail.com      NaN
2      3    adam@mail.com       NaN

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用pandas重新排列表

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >