wget --random-wait -r -p -e robots=off -U mozilla http://your.site.org <- yanks * wget --random-wait -r --no-parent -p http://your.site.org/content/dir <- grabs a dir
some useful swtiches per man:
-p ; include all files, including images. -e robots=off ; do not obey server-side robots.txt -U mozilla ; browser identity. --random-wait ; number of seconds to wait, thus avoiding server black list. --limit-rate=15k ; throttle the download rate. -b ; continue application in background. -o ; output log (as opposed to scrolling on screen).
No comments:
Post a Comment