How to get all image, pdf and other files links from a website?

dubstep · January 5, 2012, 10:42am

I am working on an application where I have to

get all the links of website
and then get the list of all the files and file extensions in each
of the web page/link.

I am done with the first part of it
now I have to get the all the files/file-extensions in each of the
page.

Can anybody guide me how to parse the links/webpage and get the file-
extensions in the page?

cyber_y · January 5, 2012, 10:49am

Is it me or has this particular homework question turned up a few times
already?

Hint: This has been asked and answered before quite recently
(yesterday even) so try reading the mailing list.