Interfaces: More flexible abstract syntax binding instead of `define`?

Open andreasabel opened this issue 6 years ago • 1 comments

This is an issue to contemplate a more flexible handling of LBNF rules in the backend. Currently there is a define pragma that allows rule names to be first-order functions over other rule names. However, this could be obsolete if we produced only an interface for abstract syntax (with a standard implementation) that could be implemented in different ways. The interface would defined methods to construct abstract syntax and a view on abstract syntax for the pretty printer.

For example, consider the (ambiguous) LBNF grammar:

EInt.   Exp ::= Integer ;
EMinus. Exp ::= Exp "-" Exp ;
EStm.   Exp ::= "{" Stm ";" Exp "}" ;

SExp.   Stm ::= Exp ";" ;
SWhile. Stm ::= "while" "(" Exp ")" Stm ;

In Haskell, this could be represented by the following abstract syntax:

{-# LANGUAGE LambdaCase #-}
{-# LANGUAGE MultiParamTypeClasses #-}
{-# LANGUAGE FunctionalDependencies #-}

-- * Model class

class Exp s e | e -> s where  -- FunctionalDependency needed
  eInt   :: Integer -> e
  eMinus :: e -> e -> e
  eStm   :: s -> e -> e

class Stm s e where -- | s -> e where  -- FunctionalDependency not needed
  sExp   :: e -> s
  sWhile :: e -> s -> s

-- ** View

class ViewExp s e where
  viewE  :: e -> ExpView s e

class ViewStm s e where
  viewS  :: s -> StmView s e

data ExpView s e
  = EInt Integer
  | EMinus e e
  | EStm s e

data StmView s e
  = SExp e
  | SWhile e s

-- * Initial syntax

data IExp
  = IEInt Integer
  | IEMinus IExp IExp
  | IEStm IStm IExp

data IStm
  = ISExp IExp
  | ISWhile IExp IStm

-- ** Constructors

instance Exp IStm IExp where
  eInt   = IEInt
  eMinus = IEMinus
  eStm   = IEStm

instance Stm IStm IExp where
  sExp   = ISExp
  sWhile = ISWhile

-- ** Views

instance ViewExp IStm IExp where
  viewE = \case
    IEInt i      -> EInt i
    IEMinus e e' -> EMinus e e'
    IEStm s e    -> EStm s e

instance ViewStm IStm IExp where
  viewS = \case
    ISExp e     -> SExp e
    ISWhile e s -> SWhile e s

One could also make an instance that directly evaluates programs:

instance Exp () Integer where
  eInt   = id
  eMinus = (-)
  eStm _ = id

instance Stm () Integer where
  sExp _ = ()
  sWhile _ _ = ()

Basically, this is generating a standard interface to what are semantic actions of conventional parser generators.

Advantages:

subsumes define (see also #266)
subsumes the undocumented views feature
allows to plug a custom implementation of parts of the abstract syntax, like the booleans of the host language for a type of booleans (see #231), or a different implementation of lists, etc.
allows any semantic actions in the parser (would need a monadic interface for effectful actions in Haskell)

Issue:

semantic actions are physically separated from grammar (in other file)

This can be an issue, but also an advantage, since the grammar can be studied by itself without being interleaved with the semantic actions. This is in the spirit of BNFC.

Disadvantage:

If one uses a custom implementation of AST and changes the grammar, one has to replay the changes on ones custom AST implementation. This is not an issue with define.

Nov 13 '19 10:11 andreasabel

I love when one solution addresses many needs!

Would this be an alternative to the GADT and functor implementations? Or would those become alternative default implementations under this design?

I presume this design will allow for position information, is that correct?

In terms of ease of use (edit: and/or error handling, printing, etc.) how do you think it compares with, say, using the current design and splitting the interpreter/compiler up into two parts:

A function that converts from AST to desired data structure/types.
Another that converts from that structure to…
- (compiler) generated code, or
- (interpreter) function taking input and producing output or actions

Oct 19 '21 12:10 ScottFreeCode