[Haskell-cafe] looking for tagsoup-parsec examples
José Romildo Malaquias
j.romildo at gmail.com
Mon Nov 22 15:53:06 EST 2010
On Mon, Nov 22, 2010 at 02:20:22PM -0200, José Romildo Malaquias wrote:
>
> I am looking for some examples of code using Text.HTML.TagSoup.Parsec,
> from the tagsoup-parsec package.
In an attempt to learn how to use tagsoup together with parsec in order
to do web scrapping, I rewrote the application that displays the
Haskell.org hit count, explained in the "Drinking TagSoup by Example"
tutorial [1]. The source code is attached.
First I tried to use tagsoup-parsec [2], but it was not too
helpful. Therefore I wrote a few parser combinators myself (inspired by
tagsoup-parsec, and the "TagSoup, meet Parsec!" blog post [3].
I am posting the program here so that other Haskell programmers can
comment on it.
I would like also to do some suggestions to the author of
tagsoup-parsec:
a) export more functions, like tagEater, which may be needed in order to
define new parsers or parser combinators; I needed them, but they were
no usable because they are not exported;
b) add more basic parsers and parser combinators (at least the ones I
have defined in my program).
c) add some examples
d) use parsec version 3
[1] http://community.haskell.org/~ndm/darcs/tagsoup/tagsoup.htm
[2] http://hackage.haskell.org/package/tagsoup-parsec
[3] http://therning.org/magnus/archives/367
Regards,
Romildo
-------------- next part --------------
A non-text attachment was scrubbed...
Name: parsehtml.hs
Type: text/x-haskell
Size: 1124 bytes
Desc: not available
Url : http://www.haskell.org/pipermail/haskell-cafe/attachments/20101122/54d8be2b/parsehtml.bin
More information about the Haskell-Cafe
mailing list