
9 Jul
2007
9 Jul
'07
7:43 a.m.
On Sun, 8 Jul 2007, Eric Mertens wrote:
I'd like to announce that I have posted a UTF-8 encoding/decoding library to hackage. This library also includes replacements for most of the System.IO namespace under System.IO.UTF8. This library detects overlong sequences, and replaces invalid code-points and invalid encodings with the replacement character '\xfffd'.
utf8-string can be found on hackage: http://hackage.haskell.org/cgi-bin/hackage-scripts/package/utf8-string-0.1
HXT returns a list of warnings for invalid UTF-8 byte sequences: http://www.fh-wedel.de/~si/HXmlToolbox/hdoc_arrow/Text-XML-HXT-DOM-Unicode.h... Is your decoder lazy?