Hi, sorry if this is a stupid question… i’ve only been programming ruby
for about six hours.
I’m trying to white a loop to parse through a webpage and get all the
links to other pages. This loop depends on a regular expression to find
all the <a href tags… but inside the loop there is another regular
expression which looks to see if the link is relative or static. The
problem is the inner regular expression changes the $1 variable so the
loop just fails on the first iteration. I’ve tried making a copy of the
$1 variable but the result just ends up containing nil.
Any help you could offer would be gratefully appreciated
Hears my code so far:
loop do url = $1 puts $1 #A url puts $url #Always nil ? if $1 =~ /^http/ //Inner regular expression new_url = host + path else new_url = path end newPage = WebPage.new(new_url, link_depth + 1) break unless url =~ @@ahref_filter end