[Haskell-cafe] Re: String vs ByteString

Johan Tibell johan.tibell at gmail.com
Sun Aug 15 03:40:02 EDT 2010


Hi Colin,

On Sun, Aug 15, 2010 at 9:34 AM, Colin Paul Adams
<colin at colina.demon.co.uk>wrote:

> But UTF-16 (apart from being an abomination for creating a hole in the
> codepoint space and making it impossible to ever etxend it) is slow to
> process compared with UTF-32 - you can't get the nth character in
> constant time, so it seems an odd choice to me.
>

Aside: Getting the nth character isn't very useful when working with Unicode
text:

* Most text processing is linear.
* What we consider a character and what Unicode considers a character
differs a bit e.g. since Unicode uses combining characters.

Cheers,
Johan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.haskell.org/pipermail/haskell-cafe/attachments/20100815/24ca0b47/attachment.html


More information about the Haskell-Cafe mailing list