[Haskell-cafe] HTML library with DOM?

Gregory Collins greg at gregorycollins.net
Thu Oct 7 08:41:19 EDT 2010

Michael Snoyman <michael at snoyman.com> writes:

> As far as I know, Neil Mitchel's tagsoup[1] parses according to the
> HTML 5 parsing rules, but it just generates a list of Tags[2], so
> you'd have to build the DOM tree up from there. I personally have had
> great experience with tagsoup. It's even the core of HTML-scraping
> technology powering searchonce[3].

Yep, someone else wrote me privately to say this (that tagsoup respects
the html5 lexing rules). So I'll be using this as the basis of an html5
DOM parser. Stay tuned!

Gregory Collins <greg at gregorycollins.net>

More information about the Haskell-Cafe mailing list