
On Wed, Oct 20, 2010 at 5:11 PM, Tyson Whitehead
I read the wikipedia article on code points, but still do not feel I have a firm grasp as to what exactly you are referring to.
If you have a few minutes, would you mind providing a short example to clarify this with a specific example (e.g., a specific code point that gives issues with a 1:1 model and what those issues are).
Have a look at combining characters: http://en.wikipedia.org/wiki/Combining_character For example, a Danish user would consider the single Unicode code point A-RING the same as the two code points A + COMBINING RING. If you process a string code point by code point you might mistakenly confuse a plain "a" (A) with a "å" (A-RING *or* A + COMBINING RING). Johan