I’m trying to scrape links using Mechanize. Sometimes accented
(on French pages) are corrupt once Ruby gets them. To see what I mean,
a = WWW::Mechanize.new
page.links.each do |a_link|
Of course, it’s only the accents that are entered in plain text (i.e.,
without entities) that have this problem. But in an imperfect world, I
always count on accents being entered properly.
Is there anything I can do about this? I’ve tried using Iconv to convert
strings to UTF-8, but that just resulted in a different (but still
character in place of the broken ones.
Thanks for any help,