how to report grammar ambiguity in antlr4

a 夏天 提交于 2019-12-11 05:29:47

问题


According to the antlr4 book (page 159), and using the grammar Ambig.g4, grammar ambiguity can be reported by:

grun Ambig stat -diagnostics

or equivalently, in code form:

parser.removeErrorListeners();
parser.addErrorListener(new DiagnosticErrorListener());
parser.getInterpreter().setPredictionMode(PredictionMode.LL_EXACT_AMBIG_DETECTION);

The grun command reports the ambiguity properly for me, using antlr-4.5.3. But when I use the code form, I dont get the ambiguity report. Here is the command trace:

$ antlr4 Ambig.g4   # see the book's page.159 for the grammar
$ javac Ambig*.java
$ grun Ambig stat -diagnostics < in1.txt # in1.txt is as shown on page.159
    line 1:3 reportAttemptingFullContext d=0 (stat), input='f();'
    line 1:3 reportAmbiguity d=0 (stat): ambigAlts={1, 2}, input='f();'
$ javac TestA_Listener.java
$ java TestA_Listener < in1.txt   # exits silently

The TestA_Listener.java code is the following:

import org.antlr.v4.runtime.*;
import org.antlr.v4.runtime.atn.*; // for PredictionMode
import java.util.*;
public class TestA_Listener {
    public static void main(String[] args) throws Exception {
        ANTLRInputStream input = new ANTLRInputStream(System.in);
        AmbigLexer lexer = new AmbigLexer(input);
        CommonTokenStream tokens = new CommonTokenStream(lexer);
        AmbigParser parser = new AmbigParser(tokens);
        parser.removeErrorListeners(); // remove ConsoleErrorListener
        parser.addErrorListener(new DiagnosticErrorListener());
        parser.getInterpreter().setPredictionMode(PredictionMode.LL_EXACT_AMBIG_DETECTION);
        parser.stat();
    }
}

Can somebody please point out how the above java code should be modified, to print the ambiguity report?

For completeness, here is the code Ambig.g4 :

grammar Ambig;

stat: expr ';'        // expression statement
    | ID '(' ')' ';'  // function call statement
    ;

expr: ID '(' ')'
    | INT
    ;

INT :   [0-9]+ ;
ID  :   [a-zA-Z]+ ;
WS  :   [ \t\r\n]+ -> skip ;

And here is the input file in1.txt :

f();

回答1:


Antlr4 is a top-down parser, so for the given input, the parse match is unambiguously:

stat -> expr -> ID -> ( -> ) -> stat(cnt'd) -> ;

The second stat alt is redundant and never reached, not ambiguous.

To resolve the apparent redundancy, a predicate might be used:

stat: e=expr {isValidExpr($e)}? ';'   #exprStmt
    | ID '(' ')' ';'                  #funcStmt
    ;

When isValidExpr is false, the function statement alternative will be evaluated.




回答2:


I waited for several days for other people to post their answers. Finally after several rounds of experimenting, I found an answer:

The following line should be deleted from the above code. Then we get the same ambiguity report as given by grun.

parser.removeErrorListeners(); // remove ConsoleErrorListener


来源:https://stackoverflow.com/questions/39827313/how-to-report-grammar-ambiguity-in-antlr4

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!