[Haskell-cafe] XML parser recommendation?

Neil Mitchell ndmitchell at gmail.com
Mon Oct 22 05:52:33 EDT 2007


Hi Ketil,

> I'm struggling to get my HXT-based parser to parse a largish file
> (<300MB), even after breaking into reasonably-sized chunks.  The
> culprit appears to be parsing one element comprising 25K lines of
> text, which apparently requires more memory than the 2Gb my computer
> is equipped with.

You can try TagSoup (http://www-users.cs.york.ac.uk/~ndm/tagsoup/)
which isn't really a complete XML parser, but may do what you want.
The other option is HaXml which Malcolm has been adding lazy parsing
to - I'm not sure if that is in a released variant or not.

Thanks

Neil


More information about the Haskell-Cafe mailing list