Go Premium for a chance to win a PS4. Enter to Win

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 617
  • Last Modified:

creating a mirror with wget

Hi All,

I'm trying to create a mirror of a website by using wget. It works pretty well on most pages. The HTML of the problem pages tries to include css files which were not downloaded by wget. What is the easiest way to solve this? See the code below.
<style type="text/css" media="screen">@import "/path/file.css";</style>

Open in new window

0
alberthendriks
Asked:
alberthendriks
1 Solution
 
thetmanvnCommented:
Pretty details on doing mirror with wget

http://linuxreviews.org/quicktips/wget/

Another Solution: Using HTTrack for Linux

http://www.httrack.com/page/2/en/index.html

Good luck
0
 
qwerty021600Commented:
<STYLE TYPE="text/css" MEDIA="screen">
<!--
  @import url(/path/file.css);
-->
</STYLE>
0
 
omarfaridCommented:
what wget command did you use?
0
 
alberthendriksAuthor Commented:
I use the code below. The idea of the strange base-url is that I could string-replace it later. However it appears nowhere, and it doesn't seem relevant.
wget --base="{{__base-url__}}" \
     --convert-links \
     -i urls.txt \
     --user-agent="Firefox faker for migration" \
     --directory-prefix ~/public_html/migratie \
     --mirror \
     --save-headers \
     --limit-rate=45k \
     -N \
     -o wget.log \
     --page-requisites \
     --cache=off \
     --exclude-directories=foo,bar

Open in new window

0
 
alberthendriksAuthor Commented:
Nobody seems to know a solution. I chose to give the points to someone who doesn't pretend to have one.
0

Featured Post

Free Tool: IP Lookup

Get more info about an IP address or domain name, such as organization, abuse contacts and geolocation.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now