在Python中，将类别的列转换为一个字符串列

2024-06-28 11:52:38 发布

您现在位置：Python中文网/ 问答频道 /正文

4450

网友

男 | 程序猿一只，喜欢编程写python代码。

这是之前提出的问题（由我问：）Oneliner to create string column from multiple columns

我想合并dataframe中的一个子集列到一个新的创建一个新的字符串列。@Zero给了我这个问题的解决方案

import pandas as pd

df = pd.DataFrame({'gender' : ['m', 'f', 'f'],\
                   'code' : ['K2000', 'K2000', 'K2001']})


col_names = df.columns
df_str = df[col_names].astype(str).apply('_'.join, axis=1)
df_str
Out[17]: 
0       K2000_m
1       K2000_f
2       K2001_f
dtype: object

但是，如果我引入区间数据，这就失败了

^{pr2}$

理想情况下，我还希望将数据转换为分类数据（这也失败了）

df_cat = pd.concat([df['gender'].astype('category'), \
                    df['code'].astype('category'), \
                    df['num'].astype('category')], axis=1)
df_cat_str = df_cat[col_names].astype(str).apply('_'.join, axis=1)

这是怎么回事？我怎样才能达到预期的产量呢

0   K2000_m_(0, 5]
1  K2000_f_(5, 10]
2  K2001_f_(5, 10]

与上一个问题一样，col_names应该是一个包含列的任何子集的列表（不一定是本例中的所有列）

Tags： columns 数据 df names col gender 子集 cat

1条回答

网友

1楼 · 发布于 2024-06-28 11:52:38

您需要在lambda函数中将每个列分别转换为str：

df_str = df[col_names].apply(lambda x: '_'.join(x.astype(str)), axis=1)
print (df_str)
0     K2000_m_(0, 5]
1    K2000_f_(5, 10]
2    K2001_f_(5, 10]
dtype: object

在Python中，将类别的列转换为一个字符串列

相关问题更多 >

编程相关推荐

热门问题

热门文章

在Python中，将类别的列转换为一个字符串列

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >