wget:download_a_full_website
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |||
wget:download_a_full_website [2016/10/18 09:10] – peter | wget:download_a_full_website [2019/12/04 22:35] (current) – removed peter | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== wget - Download a Full Website ====== | ||
- | |||
- | To download a full website and make it available for local viewing. | ||
- | |||
- | <code bash> | ||
- | wget --mirror -p --convert-links -P ./LOCAL-DIR WEBSITE-URL | ||
- | </ | ||
- | |||
- | * < | ||
- | * -p : download all files that are necessary to properly display a given HTML page. | ||
- | * < | ||
- | * -P ./LOCAL-DIR : save all the files and directories to the specified directory. | ||
- | |||
- | |||
- | ===== Download files recursively with wget ===== | ||
- | |||
- | With wget you can download files directly in a shell. If you want to download a whole site, known as downloading recursively, | ||
- | |||
- | <code bash> | ||
- | wget -r http:// | ||
- | </ | ||
- | |||
- | By default wget respects the **robots.txt** file and thus only downloads the non-private files. | ||
- | |||
- | Wget can be adviced to ignore that rules and thus it downloads the private files anyway. Set the e option as shown next. | ||
- | |||
- | <code bash> | ||
- | wget -e robots=off -r http:// | ||
- | </ | ||
wget/download_a_full_website.1476781858.txt.gz · Last modified: 2020/07/15 09:30 (external edit)