[Haskell-cafe] Is XHT a good tool for parsing web pages?

Peter Robinson thaldyron at gmail.com
Tue Apr 27 10:26:08 EDT 2010


On 27 April 2010 16:22, John Creighton <johns243a at gmail.com> wrote:
>> Subject: Is XHT a good tool for parsing web pages?
>> I looked a little bit at XHT and it seems very elegant for writing
>> concise definitions of parsers by forms but I read that it fails if
>> the XML isn't strict and I know a lot of web pages don't use strict
>> XHTML. Therefore I wonder if it is an appropriate tool for web pages.

I don't know about XHT but tagsoup [1] does a pretty good job parsing web pages.

  Peter

[1] http://hackage.haskell.org/package/tagsoup


More information about the Haskell-Cafe mailing list