
Here's a summary of the state of Unicode support in GHC and other compilers. There are several aspects:
- Can the Char type hold the full range of Unicode characters? This has been true in GHC for some time, and is now true in Hugs. I don't think it's true in nhc98 (please correct me if I'm wrong).
- Do the character class functions (isUpper, isAlpha etc.) work correctly on the full range of Unicode characters? This is true in Hugs. It's true with GHC on some systems (basically we were lazy and used the underlying C library's support here, which is patchy).
- Can you use (some encoding of) Unicode for your Haskell source files? I don't think this is true in any Haskell compiler right now. Well, even if hbc is mostly dead I must point out that it has supported
Simon Marlow wrote: this since Unicode was first added to Haskell. As well as the point above, of course. If the GHC implementors feel lazy they can always borrow the Unicode (plane 0) description table from HBC. It is a 64k file. -- Lennart