I found that, unlike Ruby 1.8, the word character class in Ruby 1.9
regexes does not match german umlauts (or any other letters other than
ASCII). According to the oniguruma documentation
(http://www.geocities.jp/kosako3/oniguruma/doc/RE.txt), it should match
everything from the unicode “letter” category, which includes umlauts.
test.rb (also attached):
s = “Ã¼”
Result with ruby 1.8:
Result with ruby 1.9.2:
Is that a bug, or is there any reason behind this behavior?