Go Premium for a chance to win a PS4. Enter to Win

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 598
  • Last Modified:

Extract site content

One of my clients currently has a site that is rather large with many pages, links to pdfs and external links.

They are currently using a CMS called MySource Matrix. I am redeveloping the new site (no CMS) on a dedicated server. I do have access to the back end of the current CMS but this thing is impossible to extract anything meaningful from. I also have no root access or any access to the server itself. It's a complete mess.

So, my question is... Is there any way to extract or build a hierarchy of each and every page (in essence a site map) and also extract all linked PDFs (hopefully maintaining some form of link to the parent page)?

Is there any method or software package to perform such a task?

I am desperate. Please, any ideas at all.

Thanks
0
rgoggins
Asked:
rgoggins
  • 3
  • 2
1 Solution
 
paololabeCommented:
I think you could use an utility or library to generate a sitemap.xml and parse it to extract pdf link

 
0
 
rgogginsAuthor Commented:
Thanks paololabe. Could you elaborate a little more on your suggestion?
0
 
rgogginsAuthor Commented:
Anyone?
0
NFR key for Veeam Backup for Microsoft Office 365

Veeam is happy to provide a free NFR license (for 1 year, up to 10 users). This license allows for the non‑production use of Veeam Backup for Microsoft Office 365 in your home lab without any feature limitations.

 
selvolCommented:
Offline Explorer Enterprise.

Will do all that and MORE.
0
 
selvolCommented:
Like I stated. Metaproducts Offline Explorer Is an Exelent Data Extractor.

I have used it many times. Doing almost exactly what you need to to.
This app is not a cheap program as many are.
Best thing is they have a Free 30 trial. I believe it is Unrestricted.

http://dl.filekicker.com/send/file/167627-YJTV/eesetup.exe
0
 
rgogginsAuthor Commented:
Thank you selvol.

OEE is absolutely perfect for what I need.

Thanks again,
Rob
0

Featured Post

Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

  • 3
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now