A simple command that splits up a string into numbers and letters

nephish · February 5, 2009, 6:40am

Hey all,

i am looking for an easy way to split a string into letters and numbers.
so if i had a string ‘34JKBY103’ i could get [‘34’, ‘JKBY’, ‘103’]

I could write up something, but thought that if there was already
something
out there that i hav’nt found, it would probably be cleaner.
thanks

sk

nephish · February 5, 2009, 7:05am

On Feb 5, 2009, at 12:39 AM, shawn bright wrote:

sk

irb> ‘34JKBY103’.scan(/\d+|\D+/)
=> [“34”, “JKBY”, “103”]

that looks like an easy way

-Rob

Rob B. http://agileconsultingllc.com
[email protected]

nephish · February 5, 2009, 7:15am

shawn bright wrote:

thanks

sk

I don’t know if there’s anything out there now (there might be), but it
seems pretty simple. Just split on either \d+ or \D+, depending
(unless I’m missing something in your requirement)?

nephish · February 5, 2009, 7:47am

I got it on the first try with this split!

irb> ‘34JKBY103dfd878dsf78s78s’.split(/([a-zA-Z]+)(?=[0-9])/)
=> [“34”, “JKBY”, “103”, “dfd”, “878”, “dsf”, “78”, “s”, “78s”]

the first set of parens is capturing while the second set of parents is
a zero width positive look ahead

nephish · February 5, 2009, 7:34am

sorry, how do i split on a \d+ ?
sk

nephish · February 5, 2009, 7:53am

This is pretty close to what you want except for the first null

‘34JKBY103dfd878dsf78s78s’.split(/(\d+)/)
=> ["", “34”, “JKBY”, “103”, “dfd”, “878”, “dsf”, “78”, “s”, “78”, “s”]

nephish · February 5, 2009, 7:56am

Tim G. wrote:

something out there that i hav’nt found, it would probably be
Someone posted this already, but:

irb(main):010:0> s = “34JKBY103”
=> “34JKBY103”
irb(main):011:0> s.scan(/\d+|\D+/)
=> [“34”, “JKBY”, “103”]
irb(main):012:0>

Also, remember, there are several ways to do this. You can use an
actual split() function with a regular expression.

See: class String - RDoc Documentation

nephish · February 5, 2009, 7:55am

shawn bright wrote:

[post.]

sk

I don’t know if there’s anything out there now (there might be), but
it
seems pretty simple. Just split on either \d+ or \D+, depending
(unless I’m missing something in your requirement)?
<please don’t quote signatures>

Someone posted this already, but:

irb(main):010:0> s = “34JKBY103”
=> “34JKBY103”
irb(main):011:0> s.scan(/\d+|\D+/)
=> [“34”, “JKBY”, “103”]
irb(main):012:0>

nephish · February 5, 2009, 8:03am

way cool, thanks, all
-sk

nephish · February 5, 2009, 8:19am

I sort of got hung up on doing this with split. Using scan might be
easier than split. However it is worth noting that when you are
splitting on a pattern you can keep what the pattern matches in the
results array by capturing with a set of ()s. Compare these two
statements:

irb> ‘a,b,c’.split(/(,)/)
=> [“a”, “,”, “b”, “,”, “c”]

irb> ‘a,b,c’.split(/,/)
=> [“a”, “b”, “c”]