[Haskell-cafe] Data.Binary suboptimal instance
Henning Thielemann
lemming at henning-thielemann.de
Sat May 23 12:00:25 EDT 2009
Khudyakov Alexey schrieb:
> On Friday 22 May 2009 23:34:50 Henning Thielemann wrote:
>>> So lately I've been working on a little program to generate trippy
>>> graphics. (Indeed, some of you may remember it from a few years back...)
>>> Anyway, getting to the point, I just restructured my program. As part of
>>> the restructuring, I got rid of all the jiggery-pokery with
>>> Data.Array.Storable and so forth and decided to use Data.Binary.
>>>
>>> My first problem was quite simple. I want to write a program that reads
>>> an item from the input file, processes it, writes it to the output file,
>>> and repeats until the input file is empty. Data.Binary doesn't appear to
>>> provide any obvious way to do this. You can't be in the Get and Put
>>> monads simultaneously, and I can't figure out how to interleave them.
>> You can! - It's again time to point out that Put shouldn't be a monad, but
>> a monoid. But as it is, Put is a Writer monad on top of the Builder
>> monoid. Better use that Builder monoid directly.
>
> Could you elaborate? I didn't quite understand.
>
>
> Anyway I had similar problem and simply wrote few functions. They
> encode/decode values of same type element by element. It's lazy enough so code
> could be written in following style:
>
>> process :: [Foo] -> [Bar]
>>
>> foo = readFile name
>> >>= writeFile out . encodeStream . process . decodeProcess
>
> There is a code. It is fast and worked for me without a problem.
>
>> -- | Decode records in repetition
>> decodeStream :: Binary a => ByteString -> [a]
>> decodeStream = runGet (getStream get)
>>
>> -- | Encode list of records as bytestring
>> encodeStream :: Binary a => [a] -> ByteString
>> encodeStream = runPut . putStream put
I think the list should be avoided at all costs, because it is so slow.
I don't know if it is fused away by clever optimizer rules in the binary
package. Anyway, you can treat a Builder like a list. Just replace (++)
by 'mappend' and [] by 'mempty'.
It should work like this:
do a <- getWord8
b <- getWord8
return $ Builder.singleton a `mappend` Builder.singleton b
More information about the Haskell-Cafe
mailing list