
27 Sep
2007
27 Sep
'07
6:55 a.m.
On Wed, Sep 26, 2007 at 11:25:30AM +0100, Tony Finch wrote:
On Wed, 26 Sep 2007, Aaron Denney wrote:
It's true that time-wise there are definite issues in finding character boundaries.
UTF-16 has no advantage over UTF-8 in this respect, because of surrogate pairs and combining characters.
Combining characters are not an issue here, just the surrogate pairs, because we're discussing representations of sequences of Chars (Unicode code points).