UTF-8 decoding

Mon Feb 20 10:23:55 EST 2006

Marcin 'Qrczak' Kowalczyk wrote:
> John Meacham <john at repetae.net> writes:
> 
> 
>>Another possibility is quasi-utf8 encoding. where it passes through any
>>invalid utf8 sequences as latin1 characters. in practice, this works
>>very well as interpreting both latin1 and utf8 transparently but is
>>more than somewhat hacky.
> 
> 
> It would not be reliable. I'm strongly against that: it gives an
> illusion that Latin1 works, but it breaks in very rare cases.

I tend to agree with Marcin here - that doesn't sound like a good solution.

Incedentally, we do ignore encoding errors in comments (more by accident 
than by design, though :-).

Cheers,
	Simon