Solved

how to delete the google cache ?

Posted on 2011-03-01
4
355 Views
Last Modified: 2013-11-19
We have developed a site that has a page that is protected by login, the problem is that before stablish the login once there was a test page (html) that used to had the protected content and seems that google indexed it, and now any person can watch it clicking on cache hiperlink

Is there any way to delete the google´s cache?

Thanks in advance.
0
Comment
Question by:dimensionav
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
4 Comments
 
LVL 15

Accepted Solution

by:
WalkaboutTigger earned 500 total points
ID: 35013588
First and foremost, go to https://www.google.com/webmasters/tools/home and setup a Webmaster account for all of the Internet-facing hosts in your domains.

Next, request the removal of the cached entry.

Next, write a robots.txt file (see http://www.robotstxt.org/ for examples and instructions) so that search engines only crawl the URLs you want them to crawl.

It would also appear the security model on your site is deficient if the landing page requires a login but traversing directly to the URL without logging in allows viewing.protected content.  This seems to be the actual root of the problem.  You can check the referring web page in server-side code and if it is not from your domain, have the web server redirect the user to the login page rather than the actual content.  Also, please insure you have custom 404 and 500 errors defined on your web server in all directories so that people get directed at your content instead of error messages.

Which web server are you using?
0
 

Author Comment

by:dimensionav
ID: 35014129
a windows VPS with plesk and asp.net enabled
0
 
LVL 15

Expert Comment

by:WalkaboutTigger
ID: 35014284
So you DEFINITELY want to create custom error messages for 403, 404 and 500 errors, insure your code for each page presentation establishes the referring URL is your server except for the login page and any public information.  Otherwise, it is simply a matter of URL hacking to get to your "protected" data.
0
 
LVL 12

Expert Comment

by:freshcontent
ID: 35018901
If you are looking for the specific directive to add to your robots.txt file, Google talks about it here:

http://googleblog.blogspot.com/2007/02/robots-exclusion-protocol.html 

Here is the specific directive to tell Googe NOT to put it into its cache (from that web page)

<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">

Open in new window

0

Featured Post

The Orion Papers

Are you interested in becoming an AWS Certified Solutions Architect?

Discover a new interactive way of training for the exam.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article will inform Clients about common and important expectations from the freelancers (Experts) who are looking at your Gig.
Although a lot of people devote their energy toward marketing for specific industries, there are some basic principles that can be applied to any sector imaginable. We’ll look at four steps to take and examine how those steps were put into action fo…
This tutorial demonstrates how to identify and create boundary or building outlines in Google Maps. In this example, I outline the boundaries of an enclosed skatepark within a community park.  Login to your Google Account, then  Google for "Google M…
Use Wufoo, an online form creation tool, to make powerful forms. Learn how to choose which pages of your form are visible to your users based on their inputs. The page rules feature provides you with an opportunity to create if:then statements for y…

688 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question