如何计算正则表达式字符串python 3中的所有匹配项

regmac = re.compile("^(([a-fA-F0-9]{2}-){5}[a-fA-F0-9]{2}|([a-fA-F0-9]{2}:){5}[a-fA-F0-9]{2}|([0-9A-Fa-f]{4}\.){2}[0-9A-Fa-f]{4})?$") regmac1 = "^(([a-fA-F0-9]{2}-){5}[a-fA-F0-9]{2}|([a-fA-F0-9]{2}:){5}[a-fA-F0-9]{2}|([0-9A-Fa-f]{4}\.){2}[0-9A-Fa-f]{4})?$" with open(file, 'r') as i: for line in i.read().split('\n'): #matches = re.findall(regmac1, line) matches = regmac.findall(line) print(matches.count(regmac1))) macmatch = len(matches) macmatch += 1 print(macmatch)

1条回答

网友

1楼 · 发布于 2024-09-30 06:28:56

每次转到新行时，都会重置macmatch。在for循环外部初始化macmatch，然后它就会工作。你的正则表达式中也有很多捕获组，这可能会影响你的比赛计数。可以在括号内使用?:来防止创建捕获组，如下所示：

^((?:[a-fA-F0-9]{2}-){5}[a-fA-F0-9]{2}|(?:[a-fA-F0-9]{2}:){5}[a-fA-F0-9]{2}|(?:[0-9A-Fa-f]{4}\.){2}[0-9A-Fa-f]{4})?$

如果您没有尝试验证MAC地址的准确性，而是只查找看起来像MAC地址的字符串（因此9C:30:5B:BB-66-7B也是可以接受的），您可以显著缩短正则表达式：

^((?:[a-fA-F0-9]{2}[:-]){5}[a-fA-F0-9]{2}|(?:[0-9A-Fa-f]{4}\.){2}[0-9A-Fa-f]{4})?$

然后您可以运行：

with open(file, 'r') as i:
    macmatch = 0
    for line in i.readlines():
        matches = regmac.findall(line)
        macmatch += len(matches)
        # OR: macmatch += (1 if matches else 0)

    print(macmatch)

相关问题更多 >

编程相关推荐

热门问题

热门文章