Wanted: Script to convert to/from UTF-8/UTF-16/UTF-32

Does someone have a little ruby script that will read a file in
UTF-8/UTF-16/UTF-32 (my choice) and search for all the characters
between
0x7f-0xffffff and convert them to an ASCII digit string that begins with
“&#” and ends with “;” and output the whole thing? If not, could someone
tell me how to write one?

How about a script to do the inverse?

Thanks!
siegfried

you could just convert everything to it’s html entity form

http://htmlentities.rubyforge.org/
gem install -r htmlentities

+David

Siegfried,

There is a wrapper library for the standard iconv that takes care of
this stuff:

http://www.ruby-doc.org/stdlib/libdoc/iconv/rdoc/index.html

Take a look at:

http://www.websideattractions.com/2007/10/24/ruby-script-to-convert-files-character-encoding/

Best regards,


Aníbal Rojas

http://[email protected]