1
When using wget with the recursive option turned on I am getting an error message when it is trying to download a file. It thinks the link is a downloadable file when in reality it should just be following it to get to the page that actually contains the files(or more links to follow) that I want.
wget -r -l 16 --accept=jpg website.com
The error message is: .... since it should be rejected. This usually occurs when the website link it is trying to fetch ends with a sql statement. The problem however doesn't occur when using the very same wget command on that link. I want to know how exactly it is trying to fetch the pages. I guess I could always take a poke around the source although I don't know how messy the project is. I might also be missing exactly what "recursive" means in the context of wget. I thought it would run through and travel in each link getting the files with the extension I have requested.
I posted this up over at stackOverFlow but they turned me over here:) Hoping you guys can help.
EDIT: Output of error message
2010-04-13 16:54:47 (128 KB/s) - `somewebsite.com/index.php?id=917218' saved [10789]
Removing somewebsite.com/index.php?id=917218 since it should be rejected.
I rather not reveal the source of the website :)
Please post the actual error message you get, or even better the full output of wget. – sleske – 2010-04-13T22:21:34.387
That message happens a lot with various websites that is traversing. – Shadow – 2010-04-13T22:58:40.013