[Haskell-cafe] Re: String vs ByteString

Johan Tibell johan.tibell at gmail.com
Tue Aug 17 07:08:47 EDT 2010


On Tue, Aug 17, 2010 at 12:39 PM, Bulat Ziganshin <bulat.ziganshin at gmail.com
> wrote:

> Hello Tom,
>
> Tuesday, August 17, 2010, 2:09:09 PM, you wrote:
>
> > In the first iteration of the Text package, UTF-16 was chosen because
> > it had a nice balance of arithmetic overhead and space.  The
> > arithmetic for UTF-8 started to have serious performance impacts in
> > situations where the entire document was outside ASCII (i.e. a Russian
> > or Arabic document), but UTF-16 was still relatively compact
>
> i don't understand what you mean. are you support all 2^20 codepoints
> in Data.Text package?
>

Yes, UTF-16 can represent all Unicode code points, using surrogate pairs.

-- Johan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.haskell.org/pipermail/haskell-cafe/attachments/20100817/c839ca69/attachment.html


More information about the Haskell-Cafe mailing list