如何将POS-tag函数应用于另一列pandas的句子中的字符串

from nltk import pos_tag columns = ['col1', 'col2'] data = pd.read_csv('data.csv', delimiter='\t', names=columns) data["col2"] = data.apply(lambda x: x["col1"].replace(x["col1"], pos_tag([x["col2"], x["col1"]])[1][1]), axis=1)

1条回答

网友

1楼 · 发布于 2024-10-02 08:29:12

一个简单的函数就可以完成这个任务。你可以这样做：

## import libraries
from nltk import word_tokenize, pos_tag, pos_tag_sents

## tag the sentece
df['col2'] = df['col2'].apply(word_tokenize).apply(pos_tag)

## this function does the magic 
def get_vals(lst):
    op = [] 
    for i, v in enumerate(lst):
        if i == 0:
            op.append(v[1])
        else:
            op.append(v[0])
    return ' '.join(op)

## apply the function
df['col2'] = df['col2'].apply(get_vals)

print(df)

   col1                      col2
0  aaa1     NNP is a great friend
1  abb2  NN is a very good friend

更新的解决方案：

此解决方案适合于用POS标记替换任何位置的字符串。在

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何将POS-tag函数应用于另一列pandas的句子中的字符串

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >