
From: libraries-bounces@haskell.org
If someone is interested in suppourting UTF8, I recommend to use oniguruma.
http://www.geocities.jp/kosako3/oniguruma/
Oniguruma also supports UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE, etc .... And it is portable, it's available both on Unix and Windows.
So I think it is best regex C library to choose backend.
Sorry, I didn't explain this so well. I mean an decoder to marshal a C-string that I know is UTF8 into a Haskell String (i.e. [Char]). An FFI call out to C might be convenient, but will have overhead. It's not that hard to write a UTF8 decoder (and encoder) in Haskell; I just wanted to avoid wasted work. Alistair ***************************************************************** Confidentiality Note: The information contained in this message, and any attachments, may contain confidential and/or privileged material. It is intended solely for the person(s) or entity to which it is addressed. Any review, retransmission, dissemination, or taking of any action in reliance upon this information by persons or entities other than the intended recipient(s) is prohibited. If you received this in error, please contact the sender and delete the material from any computer. *****************************************************************