分析异常中的错误行号

1条回答

网友

1楼 · 发布于 2024-10-06 19:27:07

这种行为是pyparsing的特征，而不是bug，需要特别小心处理（或解决）。你知道吗

当pyparsing无法匹配复杂表达式中的某个地方时，它会将其解析堆栈放回最后一个完全完整的表达式。您知道在匹配了“component”之后，后面的任何内容都应该是组件定义中的错误，但是pyparsing没有。因此，当在opening关键字之后发生故障时，pyparsing将备份并报告关键字表达式（包括关键字）无法匹配。你知道吗

当您有这样的命令语法时，关键字通常是明确的。例如，在匹配“component”之后，任何不是标识符，后跟括号中的参数列表的内容都将是一个错误。通过将“+”运算符替换为“-”运算符，可以指示pyparsing应该而不是备份过去的“component”。你知道吗

看看你的语法，我会备份并写一个简短的BNF（总是好的做法）：

communications ::= 'communications' '(' communicationList* ')' ';'
language       ::= 'language' ('cpp' | 'python') ';'
componentContents ::= communications | language | gui | options
component      ::= 'component' identifier '(' component_contents+ ')' ';'
CDSL           ::= idslImports component

当语法中有关键字时，我总是建议使用Keyword或CaselessKeyword，而不是Literal或CaselessLiteral。Literal类不强制单词边界，因此如果我使用Literal("no")作为语法的一部分，它可以匹配“not”或“none”或“nothing”等的前导“no”

下面是我将如何处理这个BNF。（我将使用setResultsName的快捷版本，我发现这样可以使语法本身更清晰）：

LBRACE,RBRACE,SEMI = map(Suppress, "{};")
identifier = pyparsing_common.identifier

# keywords - extend as needed
(IMPORT, COMMUNICATIONS, LANGUAGE, COMPONENT, CPP, 
 PYTHON, REQUIRES, IMPLEMENTS) = map(CaselessKeyword, """
    IMPORT COMMUNICATIONS LANGUAGE COMPONENT CPP PYTHON 
    REQUIRES IMPLEMENTS""".split())

# keyword-leading expressions, use '-' operator to prevent backtracking once significant keyword is parsed
communicationItem = Group((REQUIRES | IMPLEMENTS) - identifier + SEMI)
communications = Group( COMMUNICATIONS.suppress() - LBRACE + ZeroOrMore(communicationItem) + RBRACE + SEMI)
language = Group(LANGUAGE.suppress() - (CPP | PYTHON) + SEMI)

componentContents = communications('communications') & language('language') & gui('gui') & options('options')
component = Group(COMPONENT - identifier("name") + Group(LBRACE + componentContents + RBRACE)("properties") + SEMI)

CDSL = idslImports("imports") + component("component")

分析示例组件时使用：

sample = """\
Component publish
{
    Communications
    {
        requires test;
        implements test;
    };
    language python;
};
"""

component.runTests([sample])

提供：

[['COMPONENT', 'publish', [[['REQUIRES', 'test'], ['IMPLEMENTS', 'test']], ['PYTHON']]]]
[0]:
  ['COMPONENT', 'publish', [[['REQUIRES', 'test'], ['IMPLEMENTS', 'test']], ['PYTHON']]]
  - name: 'publish'
  - properties: [[['REQUIRES', 'test'], ['IMPLEMENTS', 'test']], ['PYTHON']]
    - communications: [['REQUIRES', 'test'], ['IMPLEMENTS', 'test']]
      [0]:
        ['REQUIRES', 'test']
      [1]:
        ['IMPLEMENTS', 'test']
    - language: ['PYTHON']

（顺便说一句，我喜欢使用“&；运算符对不同内容与pyparsing的Each类进行无序匹配—我认为这会使解析器更友好、更健壮。结果是Each与“-”运算符有一点冲突，我必须在下一个版本中解决这个问题。）

相关问题更多 >

编程相关推荐

热门问题

热门文章