I am trying to restore a website from webarchive for a client (wp site was hacked), i am using wget to try and download the files, it makes it a bit difficult because its a wordpress site, its only three pages, I just want to download all the HTML, CSS, JS and images for the three pages but its proving to be abit more difficult than that. I was using --accept to specify files but a lot of the css and js has caching after the file type so its skipping the, eg style.css_ver=1.23 etc
wget --recursive --no-clobber --recursive --no-check-certificate --no-directories -P /var/www/site https://web.archive.org/web/20220327200154/http://mysite.com/
Is there a better way i can do this?
will try with your regex. Thanks
I would check maybe you have a backup even if this is not the most recent this could help you to get back the design, ask your web hosting provider they may have some backup.
I would not recommend to use WP or a CMS for a 3 pages website.
I would start from scratch as just recovering html page will not help you to get back the WP site.
Next time make sure to backup or ask your web hosting provider to set that for you.
I agree with the part about just starting from scratch. It will be easier to grab the photos by right clicking and downloading them if you don't already have those. Then recreate the three pages even if it is a different theme. Make it easy on yourself. You could spend 10 times the amount of time you need by trying to save the thing than just recreating it.