Re: unicode in ruby

Have you tried ruby-oniguruma
http://www.geocities.jp/kosako3/oniguruma/
(http://www.geocities.jp/kosako3/oniguruma/) ?

Best regards,

Axel

On 3/8/06, [email protected] [email protected] wrote:

Have you tried ruby-oniguruma http://www.geocities.jp/kosako3/oniguruma/
(http://www.geocities.jp/kosako3/oniguruma/) ?

No. I want an unicode string processing library, not an unicode regexp
library. Currently I am working on a program that needs to work with
Czech text.
Unicode is not supported in current ruby (1.8), and I can work around
that by using iso-8859-2. I choose the workaround because I need ease
of installation, and the latin2 encoding is still widely used.
With ruby 2 onigurama should be part of the interpreter. But it still
does not give me downcase in utf-8. At least it should be possible to
split a string into characters using Onigurama, and process the
characters myself.

Thanks

Michal


Support the freedom of music!
Maybe it’s a weird genre … but weird is not illegal.
Maybe next time they will send a special forces commando
to your picnic … because they think you are weird.
www.music-versus-guns.org http://en.policejnistat.cz