[Okta Webinar] Learn how to a build a cloud-first strategyRegister Now

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 185
  • Last Modified:

extract data from web

Hi,

collecting data from websites manually is very hard and time consuming, I'm looking for free application to extract data from web and put it in database or file. for example: I need to get the university information (faculties, department, members, contact information .. etc )

I found different application, but it is not free and hard to learn and customized. can you please guide me to find easy, free and powerful application that do the example mentioned above in a short time.

thanks
0
nmokhayesh
Asked:
nmokhayesh
  • 4
  • 3
1 Solution
 
Michel PlungjanIT ExpertCommented:
0
 
Ray PaseurCommented:
Do you want to search the contents of web sites, or do you want to copy the web sites?
0
 
nmokhayeshAuthor Commented:
I need to copy selected contents
for instance
list of all professors names and their contacts (email , tel, website, research interests)
list of all departments and contact information
list of schools and programs discription in each one

thank
 
0
Get your Conversational Ransomware Defense e‑book

This e-book gives you an insight into the ransomware threat and reviews the fundamentals of top-notch ransomware preparedness and recovery. To help you protect yourself and your organization. The initial infection may be inevitable, so the best protection is to be fully prepared.

 
Ray PaseurCommented:
I have used httrack and it worked fairly well to make a copy of the web site onto my hard drive.  Selection of the contents was still the major issue.  Although the local web site was faster than using the internet, you would still have to manually or programmatically isolate the information you wanted to keep.

It might be possible to get a copy of Wrensoft Zoom Indexer and use that to spider the site.  Caveat: I have never tried that on a site that I did not control.

One other possibility might be to contact the site owners and ask if they can isolate this information for you.  Educational institutions are often willing to help with requests like this.
0
 
nmokhayeshAuthor Commented:
OK I need to extract selected data from some university web pages to XML or excel sheet file using web scraping software

can you please tell me which free web scraping application can do this job in easy way. I search it but i got a lot of application but I do not know which one is useful/efficient

Thanks
Naif
0
 
Ray PaseurCommented:
There is no "easy way" because there is no clear vision of what you want to extract.  Each university web site is likely to be a bespoke application, so each such scraping and extraction algorithm will require custom programming.

That is why I recommended that you contact the site owners and ask if they can isolate this information for you.
0
 
nmokhayeshAuthor Commented:
still not solved 100%
0
 
Ray PaseurCommented:
No, it will never get solved 100%, full stop.  Here is what you asked for back in March (how many months ago was that?)

...find easy, free and powerful application that do the example mentioned above in a short time.

It would surprise me if you find easy, free and powerful all in the same package.  Those things are like Ohm's law.  Fix any two variables and the third is determined.
0

Featured Post

Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

  • 4
  • 3
Tackle projects and never again get stuck behind a technical roadblock.
Join Now