Why parsing failed #1808

PavelDymkov · 2022-06-16T08:32:08Z

PavelDymkov
Jun 16, 2022

Code for playground

(function jsonGrammarOnlyExample() {
  const { createToken, EmbeddedActionsParser, Lexer } = chevrotain;

  const LCurly = createToken({ name: "LCurly", pattern: /{/, label: "{" });
  const RCurly = createToken({ name: "RCurly", pattern: /}/, label: "}" });
  const Comma = createToken({ name: "Comma", pattern: /,/, label: "," });
  const Word = createToken({ name: "Word", pattern: /\w+/ });
  const Space = createToken({ name: "Space", pattern: /\s+/ });

  const jsonTokens = [RCurly, LCurly, Comma, Word, Space];
  const JsonLexer = new Lexer(jsonTokens);

  class JsonParser extends EmbeddedActionsParser {
    constructor() {
      super(jsonTokens)

      const $ = this;

      $.RULE("members", () => {
        $.CONSUME(LCurly);

        this.MANY(() => {
          $.OPTION(() => $.CONSUME1(Space));

          $.CONSUME(Word);

          $.OPTION1(() => $.CONSUME2(Space));

          $.CONSUME(Comma);
        });

        $.OPTION2(() => {
          $.OPTION3(() => $.CONSUME3(Space));

          $.CONSUME1(Word);
        });

        $.OPTION4(() => $.CONSUME4(Space));

        $.CONSUME(RCurly);

        return "SUCCESS";
      });

      this.performSelfAnalysis();
    }

  }

  return {
    lexer: JsonLexer,
    parser: JsonParser,
    defaultRule: "members"
  };
}())

Parsing succeed for:

{ X,Y }
{ X, }
{X }
{ X}

But failed for:

{ X, Y }
{ X }

Looks like the problem in OPTION3. But whats wrong?

Answered by msujew

Jun 16, 2022

Hi @PavelDymkov,

When entering the MANY call, Chevrotain will try to identify whether it should actually parse it using it's lookahead algorithm. So it basically compares the next tokens in the input with the possible next tokens in the MANY call. The issue is that the tokens in the MANY call match exactly with the tokens after that. The tokens { ,X, " " can be parsed using either:

this.MANY(() => {
  $.OPTION(() => $.CONSUME1(Space));
  $.CONSUME(Word);
  $.OPTION1(() => $.CONSUME2(Space));
  $.CONSUME(Comma);
});

or

$.OPTION2(() => {
  $.OPTION3(() => $.CONSUME3(Space));
  $.CONSUME1(Word);
});
$.OPTION4(() => $.CONSUME4(Space));

Because Chevrotain will always take the first match for p…

View full answer

NaridaL · 2022-06-16T09:05:51Z

NaridaL
Jun 16, 2022
Collaborator

You can tell chevrotain to ignore the whitespace token, that should make your grammar a lot simpler... https://chevrotain.io/docs/tutorial/step1_lexing.html#skipping-tokens

0 replies

msujew · 2022-06-16T09:08:20Z

msujew
Jun 16, 2022
Collaborator

Hi @PavelDymkov,

When entering the MANY call, Chevrotain will try to identify whether it should actually parse it using it's lookahead algorithm. So it basically compares the next tokens in the input with the possible next tokens in the MANY call. The issue is that the tokens in the MANY call match exactly with the tokens after that. The tokens { ,X, " " can be parsed using either:

this.MANY(() => {
  $.OPTION(() => $.CONSUME1(Space));
  $.CONSUME(Word);
  $.OPTION1(() => $.CONSUME2(Space));
  $.CONSUME(Comma);
});

or

$.OPTION2(() => {
  $.OPTION3(() => $.CONSUME3(Space));
  $.CONSUME1(Word);
});
$.OPTION4(() => $.CONSUME4(Space));

Because Chevrotain will always take the first match for parsing, it will always parse any string which starts with { X using the MANY call and doesn't even try to enter the OPTION2 call. However, since the MANY call expects a Comma at the end, it fails, since there's none in the input stream.

Here's an improved version of this rule:

$.RULE("members", () => {
  $.CONSUME(LCurly);
  $.OPTION(() => {
    $.OPTION1(() => $.CONSUME1(Space));
    $.CONSUME(Word);
    $.OPTION2(() => $.CONSUME2(Space));
    $.MANY(() => {
      $.CONSUME(Comma);
      $.OPTION3(() => $.CONSUME3(Space));
      $.CONSUME1(Word);
      $.OPTION4(() => $.CONSUME4(Space));
    });
    $.CONSUME(RCurly);
  });
return "SUCCESS";

Also please use ignored whitespace tokens, as @NaridaL suggests, it makes grammars a lot simpler 👍

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why parsing failed #1808

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Why parsing failed #1808

PavelDymkov Jun 16, 2022

Replies: 2 comments

NaridaL Jun 16, 2022 Collaborator

msujew Jun 16, 2022 Collaborator

PavelDymkov
Jun 16, 2022

NaridaL
Jun 16, 2022
Collaborator

msujew
Jun 16, 2022
Collaborator