package cairn

  1. Overview
  2. Docs
A derivation explorer and logger for menhir parser

Install

Dune Dependency

Authors

Maintainers

Sources

cairn-1.0.tbz
sha256=1ec8a04fa25fbd0aeef319aecd4ced16cac14b16456a812a69282db415e54258
sha512=c01dce833ac9155b732930e87dc799a161ee2313f48746afa5848c7f0c8eef47dc63f0508aad72240c0a7226dad298dc11a90d8ad86fb854c4143bbe70511d1f

Description

A small library that takes a parser generated by menhir, logs its execution and provides an interactive terminal user interface to explore it.

Published: 02 Aug 2023

README

Cairn - A derivation explorer and logger for Menhir parser

Cairn is a small library that can be used conjointly with Menhir parser generator. Its goal is to provide a step-by-step logger to the execution of the LR1 automaton on its input, in order to visualize the sequence of partial derivations produced (until the final derivation). Its goal is to be a teaching (or self-teaching) tool of the behavior of LR1 parser.

It can either log its result as log files, or launch a small terminal explorer of the derivation (in utf8), made with lambda-term.

It is not made to be executed or very long inputs, as the trees would be to huge to be properly displayed (and understood), and the tool storing every step of the execution of the parser, a long execution would be heavy in memory. It has been tested on inputs of around several hundreds of tokens without memory issues.

Install

From source - globally

  • dune build

  • dune install

This will install it in your current opam installation. You might then use cairn library in your projects.

From source - locally

Simply copy src directory to your project and add library cairn where you want to use it.

Usage

To use Cairn, you must first produce a Menhir grammar. The grammar must be compiled with options --table --inspection --cmly (to produce the incremental API and the cmly file that cairn needs as an input). Option --compile-errors may as well be used to produce the file that contains error messages to be displayed by the parser (that is also used by Cairn).

Thus, in the file where you want to use your parser, you must create a module of type parser_decorated (see Parsing.mli) that contains the modules produced by Menhir plus some options, and create a module of type MenhirSdk.Cmly_api.GRAMMAR obtained by reading the cmly file produced by Menhir (see MenhirSdk for more details on that).

You then create the Cairn module using functor Parsing.Make or Parsing.MakeWithDefaultMessage. This module will provide you with parsing functions working like Menhir parsing functions, but which logs and/or displays the execution of your parser.

Cairn also allows you to get the log as an abstract object (ParserLog.configuration list) that you can then use with functions of ParserLog if you want more custom usage.

Cairn is able to use two attributes that you can add on the Menhir file:

  • short that allows you to give a smaller or more explicit name to a terminal or non-terminal to improve readability of the trees.

  • backtrack that is used by the naïve error mechanism recovery. When an error occurs, the parser pops the stack until a element with attributes backtrack is set, and then discards the inputs token until it can shift one when it resumes parsing.

Known limitation and issues

  • Cairn has only be made and tested on Linux. Using it on other systems might fail.

  • The error mechanism recovery (activated with value PopFirst for error_strategy) is very basic and might be useless as is. It might be improved in the future.

  • The lookahead displays only the text parsed and not the corresponding token. That is due to a limitation in Menhir that does not allow to get the terminal corresponding to a token, and that only terminals can be converted to their names, and not tokens.

  • The cmly handling is quite tricky as it must be generated by menhir, then used in the program at execution time. A way to properly deal with this is shown in the examples (with ocaml-crunch), but other solutions might exists. To my knowledge, it is not possible to generate the Grammar module directly with menhir invocation, which would be easier.

Possible improvements

  • Improving display (e.g. navigating stacks and state items, etc.).

  • Showing LR1 objects instead of LR0 ones.

  • Improving error mechanism and introducing other ones (limited by Menhir not allowing to get terminal from token).

  • Others you might suggest.

Authors

  • Vincent Penelle, university of Bordeaux

Dependencies (9)

  1. zed >= "3.0.0"
  2. crunch
  3. lambda-term
  4. menhir >= "20230608"
  5. ocaml >= "4.08"
  6. dune >= "2.9"
  7. ocaml
  8. menhirLib
  9. menhirSdk

Dev Dependencies (1)

  1. odoc with-doc

Used by

None

Conflicts

None