Go Premium for a chance to win a PS4. Enter to Win

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 555
  • Last Modified:

Is scraping a website allowed?

Is it okay to scrape a website for data?

It seems to me that if the data is publically available and even the source code for the page is visible through the View Source option, that scraping the website is also allowed.  But I need to hear what others think about this first.

What could the website do to block scraping?  Legally?  Technically?  

Would they need to block so that View Source fails?  Is that even possible?

I'm curious...

Thanks,
newbieweb

0
newbieweb
Asked:
newbieweb
  • 2
  • 2
  • 2
4 Solutions
 
mattibuttCommented:
even if you scrap the website google cache the website and its available if is no longer coming from you. blocking view-source is not possible with html.
if you are using java applet or flash then it will not be visible there are some other applications. there are techniques to disable browser saving file locally but then you are suppose to program  this for every single browser exists.
the source of website if written in server side will never be visible on the browser for instance if you used java servlet, php, .net or ruby nothing will ever be avilable publicy unless someone hacks into the webserver i am still confuse what are you trying to acheive
0
 
ghemstromCommented:
The webpage with its supporting data is protected by the publisher's copyright, which implies that you could read it but not republish it without the publishers consent. Even the linking to his page should not be done without consent if it is not evident that it is allowed...
0
 
newbiewebAuthor Commented:
I am not trying to block my website from scraping, I am trying to find if it's allowed for me to scrape other websites.  Websites exist with data I want to import.

I wonder if that is allowed, and if it's possible just by reading the HTML.
0
NFR key for Veeam Agent for Linux

Veeam is happy to provide a free NFR license for one year.  It allows for the non‑production use and valid for five workstations and two servers. Veeam Agent for Linux is a simple backup tool for your Linux installations, both on‑premises and in the public cloud.

 
mattibuttCommented:
you need author permission perior to using their content if they have forbidden the use of their content if not you can use it
0
 
ghemstromCommented:
I would say that you must have a permission if the permission is not explicitely given on the webpage.
0
 
newbiewebAuthor Commented:
Thanks.
0

Featured Post

Vote for the Most Valuable Expert

It’s time to recognize experts that go above and beyond with helpful solutions and engagement on site. Choose from the top experts in the Hall of Fame or on the right rail of your favorite topic page. Look for the blue “Nominate” button on their profile to vote.

  • 2
  • 2
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now