[Haskell-cafe] Lazy HTML parsing with HXT, HaXML/polyparse, what else?

Henning Thielemann lemming at henning-thielemann.de
Fri May 11 08:54:08 EDT 2007


On Fri, 11 May 2007, Neil Mitchell wrote:

> Depending on exactly what you want, TagSoup may be of interest to you.
> It is lazy, but it doesn't return a tree. It is very tollerant of
> errors, and will simply never "fail to parse" something.
>
> http://www-users.cs.york.ac.uk/~ndm/tagsoup/

That's an interesting option. It could be used as a lexer for a full-blown
HTML parser. Sometimes I need the tree structure. But why does this simple
piece of code needs -fglasgow-exts?


Thanks for the package and the hint!
 Henning


More information about the Haskell-Cafe mailing list