Regexp simple question

arunvoip · May 11, 2009, 4:51pm

Hi,
I’m using the following regexp to capture a particular string from a
japanese website content.

/

?
([^<]

The following is the match result.

å¥³æ€§ /

Is there a way I can remove the slash(’/’) from my result by modifying
the above regular expression.

N. B. gsub can be used but I want to know whether there it can be
achieved by modifying the above regexp

Please help.

Thanks
Arun

arunvoip · May 11, 2009, 5:43pm

In words, describe what just the regex part does.

arunvoip · May 11, 2009, 5:45pm

7stud – wrote:

In words, describe what just the regex part does.

I mean the part between the

tags.

arunvoip · May 11, 2009, 5:46pm

Your boss doesn’t like gsub?

Try

/

?
([^</]

That should work, but it won’t work for a case where you have /
separating
something in the inner text.

Jayanth

arunvoip · May 11, 2009, 9:41pm

That bugle’s been blown to death mate.

Jayanth

arunvoip · May 11, 2009, 6:06pm

Arun K. wrote:

I’m using the following regexp to capture a particular string from a
japanese website content.

/

.?
([^<]
?)</li>/m

Parsing HTML with Regexp makes certain baby dieties cry.

Use Nokogiri, with an XPath of ‘/ul[ @id = “ownerProfile” and @class =
“owner” ]’. Then pull out the .text and you are done!