UTF-8 decoding error

Christian Maeder maeder at tzi.de
Thu Sep 21 05:49:47 EDT 2006


currently haddock correctly translates latin1 chars, ie. äöü to
äöü

So it would be nice if also ghc-6.6 could remain backward compatible by
supporting latin1 sources.

Christian


Ross Paterson schrieb:
> On Thu, Sep 21, 2006 at 10:15:45AM +0200, Christian Maeder wrote:
>> How does haddock handle characters in comments?
> 
> Section 3.8.3 of the Haddock manual:
> 
> 	3.8.3. Character references
> 
> 	Although Haskell source files may contain any character from the
> 	Unicode character set, the encoding of these characters as bytes
> 	varies between systems, so that only source files restricted to
> 	the ASCII character set are portable. Other characters may be
> 	specified in character and string literals using Haskell character
> 	escapes. To represent such characters in documentation comments,
> 	Haddock supports SGML-style numeric character references of the
> 	forms &#D; and &#xH; where D and H are decimal and hexadecimal
> 	numbers denoting a code position in Unicode (or ISO 10646).
> 	For example, the references λ, λ and λ all
> 	represent the lower-case letter lambda.
> 
> Not pretty, but it is portable and not limited to the Latin-1 subset.


More information about the Glasgow-haskell-users mailing list