
27 Sep
2007
27 Sep
'07
7:15 p.m.
In message
On Thu, 27 Sep 2007, Ross Paterson wrote:
Combining characters are not an issue here, just the surrogate pairs, because we're discussing representations of sequences of Chars (Unicode code points).
I dislike referring to unicode code points as "characters" because that tends to imply a lot of invalid simplifications.
Just to be pedantic, Ross did say Char not character. A Char is defined in the Haskell report as a Unicode code point. As you say, that does not directly correspond to what many people think of as a character due to combining characters etc. Duncan