"Stuart" == Stuart Cook  writes:
Stuart> On Thu, Aug 20, 2009 at 4:28 PM, Colin Paul
    Stuart> Adams wrote:
    >> But how do you get Latin-1 bytes from a Unicode string? This
    >> would need a transcoding process.
    Stuart> The first 256 code-points of Unicode coincide with
    Stuart> Latin-1. Therefore, if you truncate Unicode characters
    Stuart> down to 8 bits you'll effectively end up with Latin-1 text
    Stuart> (except that any code points above U+00FF will give
    Stuart> strange results).
    Stuart> If your terminal then interprets these bytes as UTF-8 (or
    Stuart> anything else, really), the result will be gibberish or
    Stuart> worse.
Yes, but surely this will work both ways. The same bytes on input
should come back on output, shouldn't they?
-- 
Colin Adams
Preston Lancashire