UTF-8 decoding error
Ross Paterson
ross at soi.city.ac.uk
Thu Sep 21 04:36:01 EDT 2006
On Thu, Sep 21, 2006 at 10:15:45AM +0200, Christian Maeder wrote:
> How does haddock handle characters in comments?
Section 3.8.3 of the Haddock manual:
3.8.3. Character references
Although Haskell source files may contain any character from the
Unicode character set, the encoding of these characters as bytes
varies between systems, so that only source files restricted to
the ASCII character set are portable. Other characters may be
specified in character and string literals using Haskell character
escapes. To represent such characters in documentation comments,
Haddock supports SGML-style numeric character references of the
forms &#D; and &#xH; where D and H are decimal and hexadecimal
numbers denoting a code position in Unicode (or ISO 10646).
For example, the references λ, λ and λ all
represent the lower-case letter lambda.
Not pretty, but it is portable and not limited to the Latin-1 subset.
More information about the Glasgow-haskell-users
mailing list