
20 Feb
2006
20 Feb
'06
10:23 a.m.
Marcin 'Qrczak' Kowalczyk wrote:
John Meacham
writes: Another possibility is quasi-utf8 encoding. where it passes through any invalid utf8 sequences as latin1 characters. in practice, this works very well as interpreting both latin1 and utf8 transparently but is more than somewhat hacky.
It would not be reliable. I'm strongly against that: it gives an illusion that Latin1 works, but it breaks in very rare cases.
I tend to agree with Marcin here - that doesn't sound like a good solution. Incedentally, we do ignore encoding errors in comments (more by accident than by design, though :-). Cheers, Simon