Hpricot - parse html

hi @all

I would like to parse html code and remove all tags that starts with

How can I remove this tags with regex? I used the gsub! function to
manipulate the string.

Thanks for helping…

Try this…

C:\temp>irb
irb(main):001:0> mystring = “xxx yy zz”
=> “xxx yy zz”
irb(main):002:0> mystring.gsub(//,’’)
=> “xxx yy zz”

Regards,
Jim

You should also process the \n, \r char.

So I think the regex should be “”.

On Jan 3, 2008 4:37 AM, sishen [email protected] wrote:

You should also process the \n, \r char.

So I think the regex should be “”.

Don’t forget about the multiline option, it’s easy, just stick an ‘m’
after the regexp.

Daniel Brumbaugh K.