Help with Hpricot and collect


#1

Hi and thanks for the help, I am new to ruby and I am trying to get some
data from a website using the Hpricot library.I am doing this: links =
Hpricot(index_page).search(“td.det_movie”).collect{ |link| link[“href”]
} to get an array of data but the thing is that right now I need some
text out of tags and the previous version only works with tags.
This is part of the code that I am reading, and I need the part that is
not in tags:

the file is very large and I need all the objects with the
class=“det_movie”.

thanks for any help

6:25 9:20 p.m. (Mon./Fri.) 4:00 p.m. (No previews) (Sat./Sun./Hol.) 1:00 p.m. (No previews) 3:30 p.m.

#2

require ‘hpricot’

doc=Hpricot(‘The Day
the Earth Stood Still
The Day the Earth Stood Still’)

links=(doc/"//a.link").collect do |link|
 [link[:href],link.inner_text]
end

=> [
[“detalle_movie.php?mv_id=618”, “The Day the Earth Stood Still”],
[“detalle_movie.php?mv_id=618”, “The Day the Earth Stood Still”]
]