Html parses

Hello

Does anyone knows about an html parser in ruby?
One that separate all tags…

thx

On Dec 8, 2005, at 12:24 PM, Marcio Francisco wrote:

Hello

Does anyone knows about an html parser in ruby?
One that separate all tags…

Sure. Here’s one possibility:

James Edward G. II

Marcio Francisco wrote:

Hello

Does anyone knows about an html parser in ruby?
One that separate all tags…

thx


Posted via http://www.ruby-forum.com/.

I’m time trialling python and ruby parser right now, so I’m going to
spew you a list of links for:

html-parser-2
htree
ymHTML module
htmltools: requires patched html-parser (gem)

rubyful soup:
WWW::Mechanize # built on htmltools, xmltree,
htmltokenizer: handles mismatched tags (gem)

REXML: Tree & stream parsing
(Yeah, that’s a lot of libs)

Here’s the spew:

http://raa.ruby-lang.org/project/html-parser-2/

http://cvs.m17n.org/~akr/htree/

http://www.yoshidam.net/Ruby.html#ymHTML

http://ruby-htmltools.rubyforge.org/
http://ruby-htmltools.rubyforge.org/doc/
http://bike-nomad.com/ruby/

http://rubyforge.org/projects/wee/

http://rubyforge.org/projects/htmltokenizer/

http://www.germane-software.com/software/rexml/