[Haskell-cafe] very strange utf8 problem

David Menendez dave at zednenem.com
Mon Feb 1 12:22:07 EST 2010

2010/2/1 Günther Schmidt <gue.schmidt at web.de>:
> Hi all,
> I know this sounds daft but I do have good reason to ask.
> Is it possible that GHC's core itself has a problem with a particular Umlaut
> only?
> HDBC-ODBC won't read in data from an SQLite database as soon as it comes
> accross a *lowercase* U-Umlaut ("ü") ghci crashes. Other Umlauts ("ä", "ö"
> and "ß") pass however.
> This is the error message:
>  readUTF8Char: illegal UTF-8 character 252
> As I said, other Umlauts do pass.

I suspect something is trying to read ISO-Latin-1 data as UTF-8. 252
is the Unicode and Latin-1 code point for "ü", but in UTF-8 it's
written in two bytes as 0xC3BC.

Dave Menendez <dave at zednenem.com>

More information about the Haskell-Cafe mailing list