Forum: Ruby UnicodeUtils 1.4.0 - case conversion, normalization and more

Posted by Stefan Lang (Guest)
on 2012-09-30 23:23
(Received via mailing list)
New in this release:
====================

* Updated to Unicode 6.2.0.

* UnicodeUtils.debug accepts single Integer code point

* New method UnicodeUtils.white_space_char?

Usage
=====

Ruby 1.9.1 or higher is required.

  $ gem install unicode_utils

  require "unicode_utils/display_width"
  UnicodeUtils.display_width("にっき") # => 6

  $ irb -r unicode_utils/u

  irb(main):001:0> U.debug 0x20ba
   Char | Ordinal | Sid               | General Category | UTF-8
  ------+---------+-------------------+------------------+----------
   "₺"  |    20BA | TURKISH LIRA SIGN | Currency_Symbol  | E2 82 BA

  irb(main):003:0> U.casefold("Straße") == U.casefold("STRASSE")
  => true

  irb(main):004:0> U.titlecase "willkommen österreich"
  => "Willkommen Österreich"

  irb(main):005:0> U.nfkc "finland"
  => "finland"

Documentation & Source
======================

 http://unicode-utils.rubyforge.org
 http://github.com/lang/unicode_utils

Issues
======

It should work on all Ruby 1.9.1 implementations or higher
independently of operating system. If not, please report
it on http://github.com/lang/unicode_utils/issues

All tests pass with jruby-1.7.0.RC1. Not all tests pass with
MRI 1.9.3p194 due to unexptected behaviour of String#<< with
UTF-16 strings. As long as you use only UTF-8, there's no problem.
(http://bugs.ruby-lang.org/issues/7090).
Please log in before posting. Registration is free and takes only a minute.
Existing account (Switch to SSL-encrypted connection)
NEW: Do you have a Google/GoogleMail or Yahoo account? No registration required!
Log in with Google account | Log in with Yahoo account
No account? Register here.