使用正则表达式替换文本文件中的多个实体

def multireplace(Text, Vars): dictSorted = sorted(Vars, key=len, reverse=True) regEx = re.compile('|'.join(map(re.escape, dictSorted))) return regEx.sub(lambda match: Vars[match.group(0)], Text) text = multireplace(text, find_replace_dict)

1条回答

网友

1楼 · 发布于 2024-10-02 12:25:50

看一看并通读评论。如果有任何不合理的地方请告诉我：

import re

def replace(text, replacements):
    # Make a copy so we don't destroy the original.
    replacements = replacements.copy()

    # This is essentially what you had already.
    regex = re.compile("|".join(map(re.escape, replacements.keys())))

    # In our lambda, we pop the first element from the array. This way,
    # each time we're called with the same group, we'll get the next replacement.
    return regex.sub(lambda m: replacements[m.group(0)].pop(0), text)

print(replace("A A B B A B", {"A": ["A1", "A2", "A3"], "B": ["B1", "B2", "B3"]}))

# Output:
# A1 A2 B1 B2 A3 B3

更新

若要帮助解决以下注释中的问题，请尝试此版本，它将确切地告诉您哪个字符串已用完替换项：

import re

def replace(text, replacements):

    # Let's make a method so we can do a little more than the lambda.
    def make_replacement(match):
        try:
            return replacements[match.group(0)].pop(0)
        except IndexError:
            # Print out debug info about what happened
            print("Ran out of replacements for {}".format(match.group(0)))
            # Re-raise so the process still exits.
            raise

    # Make a copy so we don't destroy the original.
    replacements = replacements.copy()

    # This is essentially what you had already.
    regex = re.compile("|".join(map(re.escape, replacements.keys())))

    # In our lambda, we pop the first element from the array. This way,
    # each time we're called with the same group, we'll get the next replacement.
    return regex.sub(make_replacement, text)

print(replace("A A B B A B A", {"A": ["A1", "A2", "A3"], "B": ["B1", "B2", "B3"]}))

# Output:
# A1 A2 B1 B2 A3 B3

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用正则表达式替换文本文件中的多个实体

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >