30 Jan
2005
30 Jan
'05
7:31 a.m.
The various UTF encodings do not have this particular problem; if a UTF string is valid, then it is a unique representation of a unicode string. However, decoding is still a partial function and can fail.
And while it is partly true, it is qualified by the problems relative to canonicalization (an "é" in Unicode can both be represented as "é" or as two chars (an e and an accent) and they should (ideally) compare equal). Stefan