如何将列表中的精确字符串与考虑到空格的较大字符串进行匹配？

example_list = ['pain', 'chestpain', 'headache', 'sickness', 'morning sickness'] example_text = "The patient has kneepain as wel as a headache" emptylist = [] for i in example_text: res = [ele for ele in example_list if(ele in i)] emptylist.append(res)

3条回答

网友

1楼 · 编辑于 2024-10-03 06:21:19

使用PyParsing:

import pyparsing as pp

example_list = ['pain', 'chestpain', 'headache', 'sickness', 'morning sickness']
example_text = "The patient has kneepain as wel as a headache morning sickness"

list_of_matches = []

for word in example_list:
  rule = pp.OneOrMore(pp.Keyword(word))
  for t, s, e in rule.scanString(example_text):
    if t:
      list_of_matches.append(t[0])

print(list_of_matches)

这将产生：

['headache', 'sickness', 'morning sickness']

网友

2楼 · 编辑于 2024-10-03 06:21:19

您应该能够使用使用单词边界的正则表达式

>>> import re
>>> [word for word in example_list if re.search(r'\b{}\b'.format(word), example_text)]
['headache']

这将与'kneepain'中的'pain'不匹配，因为它不是以单词边界开始的。但它会正确地匹配包含空格的子字符串

网友

3楼 · 编辑于 2024-10-03 06:21:19

成员们在这篇文章中已经提供了很好的例子

我对疼痛不止一次的匹配文本进行了挑战。我还想了解更多关于比赛地点的信息。我最终得到了以下代码

我写了下面的句子

"The patient has not only kneepain but headache and arm pain, stomach pain and sickness"

import re
from collections import defaultdict

example_list = ['pain', 'chestpain', 'headache', 'sickness', 'morning sickness']
example_text = "The patient has not only kneepain but headache and arm pain, stomach pain and sickness"

TruthFalseDict = defaultdict(list)
for i in example_list:
    MatchedTruths = re.finditer(r'\b%s\b'%i, example_text)
    if MatchedTruths:
        for j in MatchedTruths:
            TruthFalseDict[i].append(j.start())

print(dict(TruthFalseDict))

上面给出了以下输出

{'pain': [55, 69], 'headache': [38], 'sickness': [78]}

相关问题更多 >

编程相关推荐

热门问题

热门文章