I’ve run into a problem parsing XML with REXML, and it looks like the
problem has to do with UTF-16 encoding and a bug with REXML.
I’m still running OS X 10.4.11, Ruby 1.8.6 (using Locomotive). I even
tried upgrading to the latest version of REXML 3.1.7.3 - no luck. It
still gives errors, only now it’s saying the following.
Iconv::InvalidCharacter: “>”
Is this old news to everyone? If so, is there known solution for
this?
I can’t be the only person who needs to parse UTF-16 xml in Rails.
I have an interim Kludge for now. I’m chopping the BOM piece off of
the XML and then sending it through as XML via REXML. It’s working
like a charm now. It’s not pretty, but it’ll do until I figure out if
this is part of a bigger problem.
Since this part of the string is always going to be same, slice the