Re: UTF-8 decoding

20 Feb 2006

      Marcin 'Qrczak' Kowalczyk wrote:
...
John Meacham  writes:
...
Another possibility is quasi-utf8 encoding. where it passes through any
invalid utf8 sequences as latin1 characters. in practice, this works
very well as interpreting both latin1 and utf8 transparently but is
more than somewhat hacky.
It would not be reliable. I'm strongly against that: it gives an
illusion that Latin1 works, but it breaks in very rare cases.
I tend to agree with Marcin here - that doesn't sound like a good solution.

Incedentally, we do ignore encoding errors in comments (more by accident 
than by design, though :-).

Cheers,
	Simon

Re: UTF-8 decoding

Simon Marlow