Solved

Can robots see content I don't want them to?

Posted on 2008-06-26
3
207 Views
Last Modified: 2008-06-28
I have a site built in PHP / MySQL.  On any given page, some content is displayed, while the rest remains hidden until the user logs in.  Right now, I have a meta tag with noindex set, but I don't think that's really what I want.  I don't want search engines and robots to be able to "find" the content that should only be displayed after logging in.  If I remove this meta tag, will the robots only see the free content?
0
Comment
Question by:n00b0101
3 Comments
 
LVL 24

Accepted Solution

by:
glcummins earned 500 total points
ID: 21874657
Robots are constrained by the same security model that prevents unauthorized users from viewing your paid content. If you have built your security model correctly, robots will not be able to access it.

Please note, however, that if your content is accessible without authorization, robots may do as they wish, regardless of the contents of your exclusions listed in your meta tags or robots.txt. Those tags are merely suggestions, not law. Improperly-written robots can ignore them entirely, if they wish.

The best possible conclusion is to simply ensure that your security is tight, and only your authorized users will be able to see your paid content.
0
 
LVL 3

Expert Comment

by:darkmooink
ID: 21874668
spiders can only see what you send to them so if you have a php if statement around some code and the if says no then the spider cant see it because you are not sending it to them
so in other words, go to the page, not logged in, right click it and view source and that is what the spider will see.
by spiders i mean robots
0
 
LVL 1

Expert Comment

by:jsteenkist
ID: 21874782
you can make use of robots.txt, but some will ingnore this.

robots.txt should be localed in the root of your website.

A "robots.txt" file is a text file placed on your server which contains a list of robots and "disallows" for those robots. Each disallow will prevent any address that starts with the disallowed string from being accessed.

ere's an example:

   user-agent: FreeFind
   disallow: /mysite/test/
   disallow: /mysite/cgi-bin/post.cgi?action=reply
   disallow: /a

In this example the following addresses would be ignored by the spider:

   http://adomain.com/mysite/test/index.html
   http://adomain.com/mysite/cgi-bin/post.cgi?action=reply&id=1
   http://adomain.com/mysite/cgi-bin/post.cgi?action=replytome
   http://adomain.com/abc.html

and the following ones would be allowed:

   http://adomain.com/mysite/test.html
   http://adomain.com/mysite/cgi-bin/post.cgi?action=edit
   http://adomain.com/mysite/cgi-bin/post.cgi
   http://adomain.com/bbc.html

It is also possible to use an "allow" in addition to disallows. For example:

   user-agent: FreeFind
   disallow: /cgi-bin/
   allow: /cgi-bin/Ultimate.cgi
   allow: /cgi-bin/forumdisplay.cgi
0

Featured Post

Enterprise Mobility and BYOD For Dummies

Like “For Dummies” books, you can read this in whatever order you choose and learn about mobility and BYOD; and how to put a competitive mobile infrastructure in place. Developed for SMBs and large enterprises alike, you will find helpful use cases, planning, and implementation.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
phpmyadmin 3 33
Apply tab index in forms 6 33
Could you point how to give Bootstrap's open/ close menu effect ? 25 32
Scope of $_SESSION 17 27
Building a website can seem like a daunting task to the uninitiated but it really only requires knowledge of two basic languages: HTML and CSS.
Since pre-biblical times, humans have sought ways to keep secrets, and share the secrets selectively.  This article explores the ways PHP can be used to hide and encrypt information.
This tutorial will teach you the core code needed to finalize the addition of a watermark to your image. The viewer will use a small PHP class to learn and create a watermark.
Learn how to create flexible layouts using relative units in CSS.  New relative units added in CSS3 include vw(viewports width), vh(viewports height), vmin(minimum of viewports height and width), and vmax (maximum of viewports height and width).

910 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

21 Experts available now in Live!

Get 1:1 Help Now