Sanitizing html tags (content)

Hello,

I have a variable which holds the following html code:


some textlink

Now I’d like to process it so that everything would be stripped except
object/param/embed tags (span tag and its content should be removed).

What is the best way to compliment that?

I’ve installed sanitize gem but it wont allow me to strip/remove content
inside html tags as well.

I though of regex but I’m unsure about this.

Many thanks for help!

On Oct 22, 2009, at 12:03 PM, Aljaz F. wrote:

type=“application/x-shockwave-flash” allowscriptaccess=“always”
content
inside html tags as well.

I though of regex but I’m unsure about this.

Many thanks for help!

Have a look at nokogiri. It will parse the HTML for you and you can
then tinker with it as you please. After you’re done, just write it
back out as a string.

Blessings,
TwP

On Thu, Oct 22, 2009 at 5:15 PM, Tim P. [email protected] wrote:

src=“- YouTube
I’ve installed sanitize gem but it wont allow me to strip/remove content
inside html tags as well.

I though of regex but I’m unsure about this.

Please don’t use regexes. Think of the children!

Instead, try Loofah, which is based on Nokogiri and has some nice
sanitization helpers:

http://loofah.rubyforge.org/

Good luck!