At present I’m learning about regular expressions in order to do an html
scraping assignment. Everything worked fine until I met with a problem.
Given below are two different html tags
Now what i want is to capture the url inside ‘href’ if
type=“application/rss+xml”. It really seems to be simple but it is the
position of the ‘type’ that creates the problem. In the first tag the
type is before href and in the second the type is after href. It seems
to me as an interesting problem but i need help to solve that. Please