在awk中可能更简单，但是在Python中我怎么说呢？

3条回答

网友

1楼 · 编辑于 2024-05-18 15:21:11

这个解决方案是为您的例子，而不是您的描述：只有第一个字母是头韵：

pairs = re.findall(r'((.)\w* is for \2\w* \2\w*ing his \2\w*)', fin, re.IGNORECASE)
matches = [ p[0] for p in pairs ]

若要搜索与您的描述相匹配的案例，只需将（.）替换为（\w+），并删除\w*的所有实例。在

网友
2楼 · 编辑于 2024-05-18 15:21:11

import re # read the book into a variable 'text' matches = re.findall(r'\w+ is for \w+ \w+ing his \w+', text)

网友
3楼 · 编辑于 2024-05-18 15:21:11

可以使用Python中的正则表达式执行此操作：

import re
pattern = re.compile(r'(?P<word>.*) is for (?P=word) (?P=word)ing his (?P=word)')
words = pattern.findall(text)

这与您的示例不匹配，但将匹配[word] is for [word] [word-part]ing his [word]。加调料调味。您可以在re模块docs中找到更多详细信息。在