UnicodeUtils 0.5.0 - case conversion, normalization and more


#1

UnicodeUtils is a pure Ruby library that implements
Unicode algorithms for Ruby 1.9.

Install with RubyGems:

$ gem install unicode_utils

Example code:

require “unicode_utils”

UnicodeUtils.upcase(“i”, :tr) => “Ä°”

UnicodeUtils.nfkc(“fi”) => “fi”

Online documentation: http://unicode-utils.rubyforge.org
Source: http://github.com/lang/unicode_utils

New since the 0.4.0 release:

  • The name method was renamed to char_name because it
    conflicted with Module#name.

  • Case-folding

  • A grep method to search through the Unicode character
    database and “U” as a shortcut for use in irb:

$ irb -r unicode_utils/u
irb(main):001:0> U.grep /angstrom/
=> [#<U+212B “â„«” ANGSTROM SIGN utf8:e2,84,ab>]


#2

In article
removed_email_address@domain.invalid,
Stefan L. removed_email_address@domain.invalid wrote:

VW5pY29kZVV0aWxzIGlzIGEgcHVyZSBSdWJ5IGxpYnJhcnkgdGhhdCBpbXBsZW1lbnRzClVuaWNv

I do not doubt it is full of very interesting information but I’d love
to
it… Base64 encoding for news articles is not nice, not everybody use a
web
browser to read/post articles, thanks.