[Haskell-cafe] Data.Binary stack overflow with Data.Sequence String
Gwern Branwen
gwern0 at gmail.com
Tue Mar 3 23:30:24 EST 2009
So recently I've been having issues with Data.Binary & Data.Sequence;
I serialize a 'Seq String'
You can see the file here: http://code.haskell.org/yi/Yi/IReader.hs
The relevant function seems to be:
-- | Read in database from 'dbLocation' and then parse it into an 'ArticleDB'.
readDB :: YiM ArticleDB
readDB = io $ (dbLocation >>= r) `catch` (\_ -> return empty)
where r x = fmap (decode . BL.fromChunks . return) $ B.readFile x
-- We read in with strict bytestrings to guarantee the
file is closed,
-- and then we convert it to the lazy bytestring
data.binary expects.
-- This is inefficient, but alas...
My current serialized file is about 9.4M. I originally thought that
the issue might be the recent upgrade in Yi to binary 0.5, but I
unpulled patches back to past that, and the problem still manifested.
Whenever yi tries to read the articles.db file, it stack overflows. It
actually stack-overflowed on even smaller files, but I managed to bump
the size upwards, it seems, by the strict-Bytestring trick.
Unfortunately, my personal file has since passed whatever that limit
was.
I've read carefully the previous threads on Data.Binary and Data.Map
stack-overflows, but none of them seem to help; hacking some $!s or
seqs into readDB seems to make no difference, and Seq is supposed to
be a strict datastructure already! Doing things in GHCi has been
tedious, and hasn't enlightened me much: sometimes things overflow and
sometimes they don't. It's all very frustrating and I'm seriously
considering going back to using the original read/show code unless
anyone knows how to fix this - that approach may be many times slower,
but I know it will work.
--
gwern
More information about the Haskell-Cafe
mailing list