AST Rewriting

20 Nov 2012

      Hi Everyone,

I am trying to build a function to rewrite and AST. I have and AST which is
designed to represent a computation graph. I will present a simplified
version here designed to illustrate the problem. I have tried numerous ways
of rewriting it including uniplate, recursion and Edward Kmett's
implementation of plate in his lens package.

My AST is defined using GADTs as follows:

class (ReflectDescriptor a, Typeable a, Wire a) => ProtoBuf a

data Expression a b where
  OpenTable :: (ProtoBuf b) => Int -> Table -> Expression () b
  OpenFile :: (ProtoBuf b) => Int -> String -> Expression () b
  WriteFile :: (Typeable a, ProtoBuf b) => Int -> String -> Expression a b
-> Expression b ()
  WriteTable :: (Typeable a, ProtoBuf b) => Int -> Table -> Expression a b
-> Expression b ()
  Map :: (ProtoBuf a, ProtoBuf b, ProtoBuf c) => Int -> (a -> b) ->
Expression c a -> Expression a b
  LocalMerge :: (ProtoBuf a) => Int -> [Expression c a] -> Expression c a

The user can create code inside a Monad Transformer like so:

q <- query $ do
         table <- openTable myTable
         transform <- map someFunc table
         writeTable otherTable transform

As part of this language the compiler I am building would need to for
instance transform OpenTable into a series OpenFile nodes with a LocalMerge
to merge the results together.

So uniplate cannot work over GADTs if I recall correctly.

I exchanged emails with Edward and he explained that for the lens case I
would need something like an indexed lens family from his indexed package
which is not implemented yet but which may be in the future.

The issue with recursion is that as you recurse through the AST the a b on
the Expression change and GHC cannot compile it because it wants the a b to
be the same on each recursive call.

My question to the Haskell community is how might one develop AST rewriting
functionality. One possible solution is stripping the types away from GHC
and doing all the type checking myself. That doesn't seem very good.

Another possibility that I have looked at was using hoopl. It seems very
compatible given that it is built for describing and optimizing data flow
which I am doing however the learning curve looks quite steep. I have been
reluctant so far to invest the time in it.

Has anyone developed something similar? What recommendations do you have?

Thanks.

Steve

Steve Severance

Alexander Solla

Emil Axelsson

Sean Leather

Steve Severance

Gershom Bazerman

tags

participants (5)