[Haskell-cafe] Re: Bytestrings and [Char]

Heinrich Apfelmus apfelmus at quantentunnel.de
Tue Mar 23 05:32:16 EDT 2010


Ivan Miljenovic wrote:
> Johan Tibell wrote:
>> A sequence of bytes is not the same thing as a sequence of Unicode
>> code points. If you want to replace String by something more efficient
>> have a look at Data.Text.
> 
> Though Data.Text still has the disadvantage of not being as nice to
> deal with as String, since you can't pattern match on it, etc.
> 
> Whilst it may degrade performance, treating String as a list of
> characters rather than an array provides you with greater flexibility
> of how to deal with it.

Indeed. In particular,  [Char]  and  Data.Text.Text  have different
semantics and performance.

For instance,

    cons :: Char -> Text -> Text

is O(n) whereas  (:)  is O(1). (Not sure whether stream fusion for  cons
 can change that at times.)

Furthermore, certain programs relying on laziness, like

    ahh = 'a' : ahh

    fibs = ' ' : zipWith plus fibs (tail fibs)
        where plus x y = toEnum $ fromEnum x + fromEnum y

are not possible with  Data.Text.Text . (Whether you really need these
is another question, of course.)


Regards,
Heinrich Apfelmus

--
http://apfelmus.nfshost.com



More information about the Haskell-Cafe mailing list