[Haskell-cafe] Re: String vs ByteString

Colin Paul Adams colin at colina.demon.co.uk
Tue Aug 17 07:51:32 EDT 2010


>>>>> "Ivan" == Ivan Lazar Miljenovic <ivan.miljenovic at gmail.com> writes:

> Char is not an encoding, right?

    Ivan> No, but in GHC at least it corresponds to a Unicode codepoint.

I don't think this is right, or shouldn't be right, anyway.. Surely it
stands for a character. Unicode codepoints include non-characters such
as the surrogate codepoints used by UTF-16 to map non-BMP codepoints to
pairs of 16-bit codepoints. 

I don't think you ought to be able to see a surrogate codepoint as a Char.
-- 
Colin Adams
Preston Lancashire
()  ascii ribbon campaign - against html e-mail
/\  www.asciiribbon.org   - against proprietary attachments


More information about the Haskell-Cafe mailing list