Solved

Detecting google in the referer

Posted on 2009-05-17
4
410 Views
Last Modified: 2012-05-07
I have some code that grabs the referering pages referer url and parses it to detect whether it has the string google in it. However it doesnt seem to detect it. Can some look at my code and give a suggestion on how to make it better.


$referer = $_SERVER['HTTP_REFERER'];
 
  //Did they get here from a search?
  if((preg_match('/www\.google.*/i',$referer) && !preg_match('/^http:\/\/www\.google\.com\//i', $referer))
     || preg_match('/search\.atomz.*/i',$referer)
     || preg_match('/search\.msn.*/i',$referer)
     || preg_match('/search\.yahoo.*/i',$referer)
     || preg_match('/msxml\.excite\.com/i', $referer)
     || preg_match('/search\.lycos\.com/i', $referer)
     || preg_match('/www\.alltheweb\.com/i', $referer)
     || preg_match('/search\.aol\.com/i', $referer)
     || preg_match('/search\.iwon\.com/i', $referer)
     || preg_match('/ask\.com/i', $referer)
     || preg_match('/search\.cometsystems\.com/i', $referer)
     || preg_match('/www\.hotbot\.com/i', $referer)
     || preg_match('/www\.overture\.com/i', $referer)
     || preg_match('/www\.metacrawler\.com/i', $referer)
     || preg_match('/search\.netscape\.com/i', $referer)
     || preg_match('/www\.looksmart\.com/i', $referer)
     || preg_match('/go\.google\.com/i', $referer)
     || preg_match('/dpxml\.webcrawler\.com/i', $referer)
     || preg_match('/search\.earthlink\.net/i', $referer)
     || preg_match('/search\.viewpoint\.com/i', $referer)
     || preg_match('/www\.mamma\.com/i', $referer)
     || preg_match('/home\.bellsouth\.net\/s\/s\.dll/i', $referer)
     || preg_match('/www\.ask\.co\.uk/i', $referer)) {
 
Echo "Search Engine Detected";
 
}
 
Here is the value of the referer varibale.
 
http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8

Open in new window

0
Comment
Question by:MayoorPatel
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
4 Comments
 
LVL 110

Expert Comment

by:Ray Paseur
ID: 24407843
Make a function with an array of the things you want to match.  I will post an example in  a few moments. ~Ray
0
 
LVL 110

Accepted Solution

by:
Ray Paseur earned 500 total points
ID: 24407966
HTH, ~Ray
<?php // RAY_check_refer.php
error_reporting(E_ALL);
 
// REQUIRED READING: http://en.wikipedia.org/wiki/List_of_search_engines
 
// FOR THIS TEST CASE EXAMPLE ONLY WE SET THE REFERER TO A KNOWN VALUE
// $_SERVER["HTTP_REFERER"] = 'http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8';
 
 
 
 
// TEST THE REFERER - WILL THROW NOTICE IF UNDEFINED
if (is_search_engine())  echo "    SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
if (!is_search_engine()) echo "NOT SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
 
 
 
// FUNCTION TO CHECK FOR SEARCH ENGINE REFERRAL
function is_search_engine()
{
// IF NOT REFERED
    if (empty($_SERVER["HTTP_REFERER"])) return FALSE;
 
// A LIST OF SELECTED SEARCH ENGINES
    $r = array();
    $r[] = '/google\.*/i';
    $r[] = '/search\.atomz.*/i';
    $r[] = '/search\.msn.*/i';
    $r[] = '/search\.yahoo.*/i';
    $r[] = '/msxml\.excite\.com/i';
    $r[] = '/search\.lycos\.com/i';
    $r[] = '/www\.alltheweb\.com/i';
    $r[] = '/search\.aol\.com/i';
    $r[] = '/search\.iwon\.com/i';
    $r[] = '/ask\.com/i';
    $r[] = '/search\.cometsystems\.com/i';
    $r[] = '/www\.hotbot\.com/i';
    $r[] = '/www\.overture\.com/i';
    $r[] = '/www\.metacrawler\.com/i';
    $r[] = '/search\.netscape\.com/i';
    $r[] = '/www\.looksmart\.com/i';
    $r[] = '/go\.google\.com/i';
    $r[] = '/dpxml\.webcrawler\.com/i';
    $r[] = '/search\.earthlink\.net/i';
    $r[] = '/search\.viewpoint\.com/i';
    $r[] = '/www\.mamma\.com/i';
    $r[] = '/home\.bellsouth\.net\/s\/s\.dll/i';
    $r[] = '/www\.ask\.co\.uk/i';
 
// TEST IF THIS IS A SEARCH ENGINE
    foreach ($r as $regex_string)
    {
        if (preg_match($regex_string, $_SERVER["HTTP_REFERER"])) return TRUE;
    }
 
// NOT A SEARCH ENGINE
    return FALSE;
}

Open in new window

0
 
LVL 1

Author Closing Comment

by:MayoorPatel
ID: 31582435
Excellent
0
 
LVL 110

Expert Comment

by:Ray Paseur
ID: 25144991
Thanks for the points!  It's a good question, ~Ray
0

Featured Post

Online Training Solution

Drastically shorten your training time with WalkMe's advanced online training solution that Guides your trainees to action. Forget about retraining and skyrocket knowledge retention rates.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Things That Drive Us Nuts Have you noticed the use of the reCaptcha feature at EE and other web sites?  It wants you to read and retype something that looks like this. Insanity!  It's not EE's fault - that's just the way reCaptcha works.  But it i…
This article discusses four methods for overlaying images in a container on a web page
The viewer will learn how to look for a specific file type in a local or remote server directory using PHP.
This tutorial will teach you the core code needed to finalize the addition of a watermark to your image. The viewer will use a small PHP class to learn and create a watermark.

738 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question