在pandas datafram中将数据和标签拆分为两个单独的列

2条回答

网友

1楼 · 编辑于 2024-09-24 04:26:47

也可以在数据导入过程中执行此操作。您将需要使用正则表达式作为分隔符。表达式正在查找每行后面跟有某个内容的最后一个逗号。以下是一个很好的例子：

import pandas as pd
import io

txt = u"A bunch of text, with commas, punctuations etc.,ham"

with io.StringIO(txt) as f:
    df = pd.read_csv(f,
                     sep=",(?=[^,]+$)",
                     header=None,
                     engine="python",
                     names=['name', 'label']))

print(df)

应产生：

^{pr2}$

我希望这能起到作用。在

网友

2楼 · 编辑于 2024-09-24 04:26:47

假设这是您的原始数据帧：

df

                                                Col1
0  A bunch of text, with commas, punctuations etc...
1                                 test,foo,.bar,spam

使用df.str.rsplit。在,上拆分一次，并将结果展开为两列。df.rename将优雅地重命名您的列。在

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

在pandas datafram中将数据和标签拆分为两个单独的列

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >