Hi, here’s Hpricot 0.5.
gem install hpricot --source http://code.whytheluckystiff.net
Hpricot reads HTML pages and works hard to fix them up and give you
everything you need to wind your way around them and hack them up!
Inspired by John Resig’s JQuery and Tanaka A.'s HTree.
- Hpricot is standalone. It’s dependant on no other libs, just
Ruby. - Hpricot is fast, its parser is written in C with help of the
wonderful Ragel state machine compiler. - However, Hpricot also works hard to fix up HTML and pays a small
penalty to get it right. - How hard does Hpricot work? My rule is: if Firefox parses it,
Hpricot should too.
This release has a number of really nice features. The new
to_original_html
method will try to preserve as much of the
original HTML as possible (including its mistakes) while still
merging in your changes. Also, you can test text nodes with syntax
like: //a[text()='Click Me!']
.
Should appear on Rubyforge soon enough. Thank you to all the
ticketeers and patchistadores out there, especially Leslie Wu who’s
been punching that commit button like she’s doin the turtle trap!!
_why