Hmm, that's an interesting idea, to just make benchmarks a different
testsuite protocol. Certainly we could prototype it like this.
We should try to separate concreate .cabal syntax from implementation. We could definitely try to share much of the implementation, I'm not so sure we want to share the concrete syntax.