Forum: Ruby on Rails Extracting the text content of a MS-word document

Announcement (2017-05-07): is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see and for other Rails- und Ruby-related community platforms.
Alexis P. (Guest)
on 2008-11-03 19:31
(Received via mailing list)
I need to be able to extract the text content of a MS-WORD file.
I have used antiword in the past but was wondering of there was
anything new to do that.

The application deals with uploaded MS word documents. Resumes for
I would like to get the text from the file in order to save and modify
and possibly to identify and save the images included in the file.
(the picture of the candidate)

this would have to work with both .doc and .docx versions
and it does not have to be free.

My app will be on a linux server not on a windows server.
Thank you
This topic is locked and can not be replied to.