字符串列表中的每个元素都将被spli - 问答 - Python中文网

字符串列表中的每个元素都将被spli

2024-06-28 19:13:10 发布

您现在位置：Python中文网/ 问答频道 /正文

男 | 程序猿一只，喜欢编程写python代码。

我刚刚执行了熊猫系列对象，如下所示：

0        ['str1', 'str2', 'str3', 'str4', ...]
1        ['str5', 'str6', 'str7', 'str8', ...]
2        ['str9', 'str10', 'abcde.fghi', 'str12', ...]
.
.
.

在这里，我想保留格式，并像'abcde.fghi'带分隔符''

“替换”abcde.fghi用“abcde”
在“abcde”后插入“fghi”，后跟“str12”

它需要保持这种形式。我想要的是：

2        ['str9', 'str10', 'abcde', 'fghi', 'str12', ...]

一系列的清单不仅仅是它们。就像300000行一样，所以我需要在迭代中使用split函数。你知道吗

你知道吗+++ 我在应用了nltk word\u tokenize之后得到了这个系列。你知道吗

lists_above=mydataframe['textcolum'].apply(word_tokenize)

但是，由于数据中有许多没有空格的句子，所以我尝试了更多的步骤

Tags：对象 word tokenize abcde str1 str2 str3 str4

2条回答

网友

1楼 · 编辑于 2024-06-28 19:13:10

在pandas中使用lambda列表理解可以很容易地做到这一点：

df['words'] = df['words'].apply(lambda x: [item for sub in x for item in sub.split('.')])

网友

2楼 · 编辑于 2024-06-28 19:13:10

试试这个（第一个如果你的列表是一个字符串，第二个如果你的列表是一个列表）：

df['yourcolumn'].apply(lambda x: "', '".join(x.split('.'))) 

or

df['yourcolumn'].apply(lambda x:  eval("', '".join(str(x).split('.'))))

输出：

0       ['str1', 'str2', 'str3', 'str4', 'str4']
1       ['str5', 'str6', 'str7', 'str8', 'str1']
2    ['str9', 'str10', 'abcde', 'fghi', 'str12']

相关问题更多 >

编程相关推荐

热门问题

热门文章