Forum: Ruby best technique for detecting charset/character encoding of RSS feeds

Announcement (2017-05-07): is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see and for other Rails- und Ruby-related community platforms.
Daniel C. (Guest)
on 2008-12-12 08:45
(Received via mailing list)
I have a Ruby application that fetches RSS and Atom feeds. I've tried
using  the chardet gem (UniversalDetector) to figure out what the
character encoding of each feed is. But for some strange this library
thinks a lot of feeds are EUC-KR (Korean) when they plainly aren't.

Can anyone suggest a better way to find the encoding of RSS and Atom
feeds (e.g. via BOM detection, etc.) with Ruby?
This topic is locked and can not be replied to.