从三个lis中查找相关实体

Entities: ['Ashraf', 'Afghanistan', 'Afghanistan', 'Kabul'] Relations: ['Born', 'President', 'employee', 'Capital', 'Located', 'Lecturer', 'University'] sentence_list: ['Ashraf','Born', 'in', 'Kabul', '.' 'Ashraf', 'is', 'the', 'president', 'of', 'Afghanistan', '.', ...]

# read file with open('../data/parse.txt', 'r') as myfile: json_data = json.load(myfile) for i in range(len(json_data)): # the dataset was in json format if json_data[i]['word'] in relation(json_data)[0]: # I extract the relations print(json_data[i]['word']) if json_data[i]['word'] in entities(json_data)[0]: print(json[i]['word'])

json_data2 = [] for i in range(len(json_data)): json2_data.append(json_data[i]['word']) print(json_data2) ''' Now I tried if I can find any element of `Entities` list and `Relations` list in each sentence of `sentence_list`. And then it should store matched entities and relations based on sentence to a list. ''' for line in json_data2: for rel in relation(obj): for ent in entities(obj): match = re.findall(rel, line['word']) if match: print('word matched relations: %s ==> word: %s' % (rel, line['address'])) match2 = re.findall(ent, line['word']) if match2: print('word matched entities: %s ==> word: %s' % (ent, line['address']))

1条回答

网友

1楼 · 发布于 2024-09-29 21:32:56

您可以使用以下list comprehension：

to_match = set(Entities+Relations)
l = [{j for j in to_match if j in i} 
        for i in ' '.join(sentence_list).split('.')[:-1]]

输出

[{'Ashraf', 'Born', 'Kabul'}, {'Afghanistan', 'Ashraf'}]

请注意，我正在返回一个sets列表以避免重复值，例如在EntitiesAfghanistan中出现两次。你知道吗

有用的阅读：

相关问题更多 >

编程相关推荐

热门问题

热门文章