Pyparsing:dblQuotedString在nestedExp中的解析方式不同

QUOTED = QuotedString(quoteChar = '“', endQuoteChar = '”', unquoteResults = False).setParseAction(remove_curlies) WWORD = Word(alphas8bit + printables.replace("(", "").replace(")", "")) WORDS = Combine(OneOrMore(dblQuotedString | QUOTED | WWORD), joinString = ' ', adjacent = False) TERM = OneOrMore(WORDS) NESTED = OneOrMore(nestedExpr(content = TERM)) query = '(dog* OR boy girl w/3 ("girls n dolls" OR friends OR "best friend" OR (friends w/10 enemies)))'

1条回答

网友

1楼 · 发布于 2024-09-30 18:14:59

nestedExpr接受一个可选的关键字参数ignoreExpr，以接受一个表达式，nestedExpr应使用该表达式忽略否则将被解释为嵌套的开始符或闭包符的字符，默认值是pyparsing的quotedString，它被定义为sglQuotedString | dblQuotedString。这是为了处理如下字符串：

(this has a tricky string "string with )" )

由于默认的ignoreExpr是quotedString，引号中的“）”不会被误解为右括号。在

但是，content参数也与dblQuotedString匹配。前引号字符串由nestedExpr在内部匹配，方法是跳过可能包含“（）”s的带引号的字符串，然后匹配内容，这也匹配带引号的字符串。可以使用NoMatch来抑制nestedExpr的ignore表达式：

^{pr2}$

现在应该可以给你：

[['dog* OR boy girl w/3',
 ['"girls n dolls" OR friends OR "best friend" OR', ['friends w/10 enemies']]]]

您可以在https://pythonhosted.org/pyparsing/pyparsing-module.html#nestedExpr找到更多细节和示例

相关问题更多 >

编程相关推荐

热门问题

热门文章