Grep page source


#1

hi all, any one know how to grep page source by rails?
i.e.
i wanna grep html source from http://www.ruby-forum.com/ by ruby code.

regards


#2

Joe B. wrote:

hi all, any one know how to grep page source by rails?
i.e.
i wanna grep html source from http://www.ruby-forum.com/ by ruby code.

one way would be to grab the page serverside:

http://ruby-doc.org/stdlib/libdoc/net/http/rdoc/index.html

as for the grepping, there are a variety of regular expression options,
as well as dealing with the DOM tree of the fetched page at a higher
level. i guess you could also do similar things clientside, with
greasemonkey or similar utils

if you are serious about doing a lot of webpage ‘remixing’ you could
hceck out mousehole, which is some kind of ruby web proxy


#3

thanks carmen

carmen wrote:

Joe B. wrote:

hi all, any one know how to grep page source by rails?
i.e.
i wanna grep html source from http://www.ruby-forum.com/ by ruby code.

one way would be to grab the page serverside:

http://ruby-doc.org/stdlib/libdoc/net/http/rdoc/index.html

as for the grepping, there are a variety of regular expression options,
as well as dealing with the DOM tree of the fetched page at a higher
level. i guess you could also do similar things clientside, with
greasemonkey or similar utils

if you are serious about doing a lot of webpage ‘remixing’ you could
hceck out mousehole, which is some kind of ruby web proxy