UTF-8 decoding error

Ross Paterson ross at soi.city.ac.uk
Thu Sep 21 04:36:01 EDT 2006


On Thu, Sep 21, 2006 at 10:15:45AM +0200, Christian Maeder wrote:
> How does haddock handle characters in comments?

Section 3.8.3 of the Haddock manual:

	3.8.3. Character references

	Although Haskell source files may contain any character from the
	Unicode character set, the encoding of these characters as bytes
	varies between systems, so that only source files restricted to
	the ASCII character set are portable. Other characters may be
	specified in character and string literals using Haskell character
	escapes. To represent such characters in documentation comments,
	Haddock supports SGML-style numeric character references of the
	forms &#D; and &#xH; where D and H are decimal and hexadecimal
	numbers denoting a code position in Unicode (or ISO 10646).
	For example, the references λ, λ and λ all
	represent the lower-case letter lambda.

Not pretty, but it is portable and not limited to the Latin-1 subset.



More information about the Glasgow-haskell-users mailing list