Forum: Ruby Documentation for HTMLParser

Announcement (2017-05-07): is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see and for other Rails- und Ruby-related community platforms.
Fc761ccaf6c0d7d977e2959f9bfebd06?d=identicon&s=25 Eli Bendersky (eliben)
on 2007-04-25 05:35
(Received via mailing list)

I have a few questions about parsing HTML:

1) The default docs (rdoc) for HTMLParser (the one that comes with the
Win32 binary distribution) in Ruby are very poor. Where can I find
some good documentation of the module, or better yet a tutorial /
examples ?

2) Another question: is HTMLParser built after Perl's HTML::Parser ?

3) Can someone suggest which is the best parser to tokenize and build
a tree of the HTML document ? Hpricot looks like a nice parser and is
well documented, but I'm not sure it's suitable.

Thanks in advance
This topic is locked and can not be replied to.