在列中查找元组列表的第一个元素的第一个单词？

import pandas as pd test = {'text': [ ('tom-mark', 'tom', 'tom is a good guy.'), ('Nick X','nick', 'Is that Nick?') ]}, {'text': [ ('juli', 'juli', 'Tom likes juli so much.'), ('tony', 'tony', 'Steve and Tony listen in as well.') ]}

2条回答

网友

1楼 · 编辑于 2024-05-19 15:04:30

您可以使用dataframe并使用函数映射text列的值以获得第一个名称，然后从该特定列的列表中创建列表

在函数内部，使用正则表达式仅从该列表中的所有元组中提取名字，并返回名字列表

import pandas as pd
import re


def get_first(x):
    return list(map(lambda tup: re.match(r'\w+', tup[0])[0].lower(), x))

test = {'text': [
    ('tom-mark', 'tom', 'tom is a good guy.'),
    ('Nick X','nick', 'Is that Nick?')
]}, {'text': [
    ('juli', 'juli', 'Tom likes juli so much.'),
    ('tony', 'tony', 'Steve and Tony listen in as well.')
]}

data = sum(pd.DataFrame(test).applymap(get_first)['text'].tolist(), [])

print(data)

网友

2楼 · 编辑于 2024-05-19 15:04:30

这样做是否有帮助：

import re

test = {'text': [
    ('tom-mark', 'tom', 'tom is a good guy.'),
    ('Nick X','nick', 'Is that Nick?'),
    ('juli', 'juli', 'Tom likes juli so much.'),
    ('tony', 'tony', 'Steve and Tony listen in as well.')]
}

first_names = []

for names in test['text']:
    name = re.match(r'\w+', names[0])
    first_names.append(name[0].lower())


print(first_names)

['tom', 'nick', 'juli', 'tony']

相关问题更多 >

编程相关推荐

热门问题

热门文章

在列中查找元组列表的第一个元素的第一个单词？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >