
On Sun April 27 2008 2:02:25 pm Don Stewart wrote:
zefria:
In GHC there's a GHC.Unicode library, but for a string such as " *AIOO", a GHC compiled program prints it as a string of unknown characters, and in the interpreter, the string evaluates to a string of escape sequences instead of displaying properly.
Is there a way to get/activate unicode support in GHC?
GHC supports unicode internally, and String and Char are all unicode.
To do unicode IO however, you need to use the utf8-string package:
http://hackage.haskell.org/cgi-bin/hackage-scripts/package/utf8-string
Just import and use IO functions from System.IO.UTF8, and that's it!
That's a wonderful interface, but unfortunately it appears to assume that your Unicode I/O is always UTF-8, and never UTF-16. I happen to deal with more UTF-16 data than UTF-8 over here at the moment. (Did I mention UTF-7 or UTF-EBCDIC? horrors...) There is a Haskell binding for iconv for those that need UTF-16, but it doesn't appear to have as convenient an interface. -- John