[Haskell-cafe] Re: String vs ByteString
Colin Paul Adams
colin at colina.demon.co.uk
Tue Aug 17 07:51:32 EDT 2010
>>>>> "Ivan" == Ivan Lazar Miljenovic <ivan.miljenovic at gmail.com> writes:
> Char is not an encoding, right?
Ivan> No, but in GHC at least it corresponds to a Unicode codepoint.
I don't think this is right, or shouldn't be right, anyway.. Surely it
stands for a character. Unicode codepoints include non-characters such
as the surrogate codepoints used by UTF-16 to map non-BMP codepoints to
pairs of 16-bit codepoints.
I don't think you ought to be able to see a surrogate codepoint as a Char.
--
Colin Adams
Preston Lancashire
() ascii ribbon campaign - against html e-mail
/\ www.asciiribbon.org - against proprietary attachments
More information about the Haskell-Cafe
mailing list