我需要人帮我弄清楚为什么我的正则表达式似乎不是决定性的

def getChordMatches(line): import re notes = "[ABCDEFG]"; accidentals = "(?:#|##|b|bb)?"; chords = "(?:maj|min|m|sus|aug|dim)?" additions = "[0-9]?" chordFormPattern = notes + accidentals + chords + additions fullPattern = chordFormPattern + "(?:/%s)?\s" % (notes + accidentals) matches = [removeWhitespaces(x) for x in re.findall(fullPattern, line)] positions = [x.start() for x in re.finditer(fullPattern, line)] return matches, positions

hexdump -s 45 -n 99 input.txt 000002d 20 41 6d 20 20 20 20 20 20 20 20 20 20 41 6d 2f 000003d 47 20 c2 a0 20 20 20 20 20 20 44 37 2f 46 23 20 000004d 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 000005d 46 6d 61 6a 37 0a 49 20 6c 6f 6f 6b 20 61 74 20 000006d 79 6f 75 20 61 6c 6c 20 73 65 65 20 74 68 65 20 000007d 6c 6f 76 65 20 74 68 65 72 65 20 74 68 61 74 27 000008d 73 20 73 0000090

2条回答

网友

1楼 · 编辑于 2024-06-26 14:53:43

我想问题是你给出的一行字符与和弦后面的\s不匹配，而regex表达式需要空格字符。无论如何，正则表达式都是错误的，因为它在最后一个和弦之后需要一个空格。在

尝试使用\b而不是\s

（评论后编辑）

网友

2楼 · 编辑于 2024-06-26 14:53:43

我怀疑问题与以下两个字节有关：

000003d 47 20c2 a020 20。。。在

这似乎是一个UTF-8编码的非中断空格（U+00A0）。如果这就是你的正则表达式出错的原因，我也不会感到惊讶。在

相关问题更多 >

编程相关推荐

热门问题

热门文章