[Haskell-cafe] Efficient matrix multiply using accelerate