Just got bitten by some functionality of split w/ grouped patterns:
(in ruby 1.8.6…)
“foo bar baz”.split(/bar/)
=> ["foo “, " baz”]
“foo bar baz”.split(/(bar)/)
=> ["foo “, “bar”, " baz”]
The 1.8 docs are silent on this, but the 1.9 docs at least say
"If pattern contains groups, the respective matches will be returned in
the
array as well. "
This “feature” surprised the heck out of me… Anyone know if this
functionality is intentional (from the enduser standpoint), or rather a
byproduct of implementation details?
Anyhoo, if anyone else gets weird behavior for string split including
the
pattern, or with groups in the pattern, maybe you’ll remember this or
maybe
google will tell you about this.
The 1.8 docs are silent on this, but the 1.9 docs at least say
"If pattern contains groups, the respective matches will be returned in
the
array as well. "
pickaxe2 p.619:
split
…
…
If pattern includes groups, these groups will be included in the
returned values.
…
This “feature” surprised the heck out of me… Anyone know if this
functionality is intentional (from the enduser standpoint), or rather a
byproduct of implementation details?
It can be a really handy feature. python’s regex split() function works
the same way.
Anyhoo, if anyone else gets weird behavior for string split including
the
pattern, or with groups in the pattern, maybe you’ll remember this or
maybe
google will tell you about this.
Anyhoo, if anyone else gets weird behavior for string split including
the
pattern, or with groups in the pattern, maybe you’ll remember this or
maybe
google will tell you about this.
This “feature” surprised the heck out of me… Anyone know if this
functionality is intentional (from the enduser standpoint), or rather a
byproduct of implementation details?
Anyhoo, if anyone else gets weird behavior for string split including the
pattern, or with groups in the pattern, maybe you’ll remember this or maybe
google will tell you about this.
It’s quite useful. Note that you can always use non-capturing groups
(the (?:expr) construct) if you don’t want them :
“one, two and three”.split(/\s*(and|or|,)?\s+/)
=> [“one”, “,”, “two”, “and”, “three”]
“one, two and three”.split(/\s*(?:and|or|,)?\s+/)
=> [“one”, “two”, “three”]
Fred
This forum is not affiliated to the Ruby language, Ruby on Rails framework, nor any Ruby applications discussed here.