Forum: Ruby Downloading Array of PDF Files Extracting MetaData

Announcement (2017-05-07): is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see and for other Rails- und Ruby-related community platforms.
Ad97b577f331ae29ed90da5751f2e44f?d=identicon&s=25 Dan Diebolt (dandiebolt)
on 2006-04-13 15:37
(Received via mailing list)
Let me simplify the description of a task I have to perform.

  I have an array of urls that all point to pdf files. I need to iterate
through the array and download each pdf file and extract some simple
metadata such as title, author, date, etc that is know to be in each pdf
file in either a known place or locatable with a known pattern (regular
expression). The output of this would be a directory full of the pdf
files and an html index file that lists the metadata along with a
hyperlink to the pdf file.

  Do any ruby modules exist to help with this task of downloading pdf
files and extracting text from them? Can the open-uri module do this?

  Thanks in advance
This topic is locked and can not be replied to.