Python使用详细正则表达式解析用户输入

text = input("please type somewhat coherently: ") pattern = r'''(?x) # set flag to allow verbose regexps (?:[A-Z]\.)+ # abbreviations, e.g. U.S.A. |\w+(?:[-']\w+)* # permit word-internal hyphens and apostrophes |[-.(]+ # double hyphen, ellipsis, and open parenthesis |\S\w* # any sequence of word characters # |[\d+(\.\d+)?%] # percentages, 82% |[][\{\}.,;"'?():-_`] # these are separate tokens ''' parsed = re.findall(pattern, text) print(parsed)

1条回答

网友

1楼 · 发布于 2024-06-26 08:16:14

如果您只想将百分比作为一个整体进行匹配，那么您真的应该知道regex引擎从左到右分析输入字符串和模式。如果您有一个备选方案，将选择与输入字符串匹配的最左边的备选方案，其余的甚至不会被测试。你知道吗

因此，您需要向上拉可选的\d+(?:\.\d+)?，并且捕获组应该变成非捕获组，否则findall将产生奇怪的结果：

(?x)              # set flag to allow verbose regexps
(?:[A-Z]\.)+                # abbreviations, e.g. U.S.A.
|\d+(?:\.\d+)?%           # percentages, 82%  <  PULLED UP OVER HERE
|\w+(?:[-']\w+)*            # permit word-internal hyphens and apostrophes
|[-.(]+                     # double hyphen, ellipsis, and open parenthesis
|\S\w*                       # any sequence of word characters#
|[][{}.,;"'?():_`-]       # these are separate tokens

见regex demo。你知道吗

另外，请注意，我用[][{}.,;"'?():_`-]替换了[][\{\}.,;"'?():-_`]：大括号不必转义，并且-从冒号（十进制代码58）和下划线（十进制95）形成了一个不必要的范围，匹配;、<、=、>、?、@、所有大写拉丁字母、[、\、]和^。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章