使用以正则表达式为键的dict进行多个正则表达式替换

import re text = "local foals drink cola" subs_dict = {"a":"w", "l":"co"} subs_regex = re.compile("|".join(subs_dict.keys())) text = re.sub(subs_regex, lambda match: subs_dict[match.group(0)], text) print(text) # "coocwco fowcos drink cocow"

import re text = "local foals drink cola" subs_dict = {"(?<=o)a":"w", "l(?=a)":"co"} subs_regex = re.compile("|".join(subs_dict.keys())) text = re.sub(subs_regex, lambda match: subs_dict[match.group(0)], text) >>> KeyError: "a"

import re text = "local foals drink cola" subs_dict = {"(?<=o)a":"w", "l(?=a)":"co"} subs_regex = re.compile("|".join("("+key+")" for key in subs_dict)) group_index = 1 indexed_subs = {} for target, sub in subs_dict.items(): indexed_subs[group_index] = sub group_index += re.compile(target).groups + 1 text = re.sub(subs_regex, lambda match: indexed_subs[match.lastindex], text) print(text) # "local fowls drink cocoa"

2条回答

网友

1楼 · 编辑于 2024-10-01 11:34:30

您可以通过将密钥保持为预期匹配并将replace和regex存储在嵌套的dict中来实现这一点。鉴于您希望匹配特定的字符，此定义应该有效

subs_dict = {"a": {'replace': 'w', 'regex': '(?<=o)a'}, 'l': {'replace': 'co', 'regex': 'l(?=a)'}}
subs_regex = re.compile("|".join([subs_dict[k]['regex'] for k in subs_dict.keys()]))
re.sub(subs_regex, lambda match: subs_dict[match.group(0)]['replace'], text)

'local fowls drink cocoa'

网友

2楼 · 编辑于 2024-10-01 11:34:30

如果没有要使用的表达式与空字符串匹配（如果要替换，这是一个有效的假设），则可以在|使用表达式之前使用组，然后检查找到匹配项的组：

(exp1)|(exp2)|(exp3)

或者命名组，这样就不必计算子表达式中的子组

替换功能可以查看哪个组匹配，并从列表中选择替换

我提出了这个实现：


import re
def dictsub(replacements, string):
    """things has the form {"regex1": "replacement", "regex2": "replacement2", ...}"""
    exprall = re.compile("|".join("("+x+")" for x in replacements))
    gi = 1
    replacements_by_gi = {}
    for (expr, replacement) in replacements.items():
        replacements_by_gi[gi] = replacement
        gi += re.compile(expr).groups + 1


    def choose(match):
        return replacements_by_gi[match.lastindex]

    return re.sub(exprall, choose, string)


text = "local foals drink cola"
print(dictsub({"(?<=o)a":"w", "l(?=a)":"co"}, text))

打印local fowls drink cocoa

相关问题更多 >

编程相关推荐

热门问题

热门文章