Re: Decent HTML Parser?

daveh · July 12, 2006, 3:12am

On Jul 11, 2006, at 17:55, Kevin Weller wrote:

Ooooh, thanks, that might be just what the doctor ordered…especially
if it handles a single text line of an HTML document. Right now I have
a temporary solution that involves using ruby-htmltools to parse the
entire document, then finding the part that I want with an XPath query.
However, Rubyful Soup might turn out to be a better performer if it
does what I want. Thanks so much!

Yup, sounds like what you want.

I would have preferred to just parse the whole page in my app, but Soup
is so slow that I had to regex a bunch of the noise out of the way. I
throw the remnants to Soup; after which everything’s really easy.