• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 440
  • Last Modified:

Detecting google in the referer

I have some code that grabs the referering pages referer url and parses it to detect whether it has the string google in it. However it doesnt seem to detect it. Can some look at my code and give a suggestion on how to make it better.


$referer = $_SERVER['HTTP_REFERER'];
 
  //Did they get here from a search?
  if((preg_match('/www\.google.*/i',$referer) && !preg_match('/^http:\/\/www\.google\.com\//i', $referer))
     || preg_match('/search\.atomz.*/i',$referer)
     || preg_match('/search\.msn.*/i',$referer)
     || preg_match('/search\.yahoo.*/i',$referer)
     || preg_match('/msxml\.excite\.com/i', $referer)
     || preg_match('/search\.lycos\.com/i', $referer)
     || preg_match('/www\.alltheweb\.com/i', $referer)
     || preg_match('/search\.aol\.com/i', $referer)
     || preg_match('/search\.iwon\.com/i', $referer)
     || preg_match('/ask\.com/i', $referer)
     || preg_match('/search\.cometsystems\.com/i', $referer)
     || preg_match('/www\.hotbot\.com/i', $referer)
     || preg_match('/www\.overture\.com/i', $referer)
     || preg_match('/www\.metacrawler\.com/i', $referer)
     || preg_match('/search\.netscape\.com/i', $referer)
     || preg_match('/www\.looksmart\.com/i', $referer)
     || preg_match('/go\.google\.com/i', $referer)
     || preg_match('/dpxml\.webcrawler\.com/i', $referer)
     || preg_match('/search\.earthlink\.net/i', $referer)
     || preg_match('/search\.viewpoint\.com/i', $referer)
     || preg_match('/www\.mamma\.com/i', $referer)
     || preg_match('/home\.bellsouth\.net\/s\/s\.dll/i', $referer)
     || preg_match('/www\.ask\.co\.uk/i', $referer)) {
 
Echo "Search Engine Detected";
 
}
 
Here is the value of the referer varibale.
 
http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8

Open in new window

0
MayoorPatel
Asked:
MayoorPatel
  • 3
1 Solution
 
Ray PaseurCommented:
Make a function with an array of the things you want to match.  I will post an example in  a few moments. ~Ray
0
 
Ray PaseurCommented:
HTH, ~Ray
<?php // RAY_check_refer.php
error_reporting(E_ALL);
 
// REQUIRED READING: http://en.wikipedia.org/wiki/List_of_search_engines
 
// FOR THIS TEST CASE EXAMPLE ONLY WE SET THE REFERER TO A KNOWN VALUE
// $_SERVER["HTTP_REFERER"] = 'http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8';
 
 
 
 
// TEST THE REFERER - WILL THROW NOTICE IF UNDEFINED
if (is_search_engine())  echo "    SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
if (!is_search_engine()) echo "NOT SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
 
 
 
// FUNCTION TO CHECK FOR SEARCH ENGINE REFERRAL
function is_search_engine()
{
// IF NOT REFERED
    if (empty($_SERVER["HTTP_REFERER"])) return FALSE;
 
// A LIST OF SELECTED SEARCH ENGINES
    $r = array();
    $r[] = '/google\.*/i';
    $r[] = '/search\.atomz.*/i';
    $r[] = '/search\.msn.*/i';
    $r[] = '/search\.yahoo.*/i';
    $r[] = '/msxml\.excite\.com/i';
    $r[] = '/search\.lycos\.com/i';
    $r[] = '/www\.alltheweb\.com/i';
    $r[] = '/search\.aol\.com/i';
    $r[] = '/search\.iwon\.com/i';
    $r[] = '/ask\.com/i';
    $r[] = '/search\.cometsystems\.com/i';
    $r[] = '/www\.hotbot\.com/i';
    $r[] = '/www\.overture\.com/i';
    $r[] = '/www\.metacrawler\.com/i';
    $r[] = '/search\.netscape\.com/i';
    $r[] = '/www\.looksmart\.com/i';
    $r[] = '/go\.google\.com/i';
    $r[] = '/dpxml\.webcrawler\.com/i';
    $r[] = '/search\.earthlink\.net/i';
    $r[] = '/search\.viewpoint\.com/i';
    $r[] = '/www\.mamma\.com/i';
    $r[] = '/home\.bellsouth\.net\/s\/s\.dll/i';
    $r[] = '/www\.ask\.co\.uk/i';
 
// TEST IF THIS IS A SEARCH ENGINE
    foreach ($r as $regex_string)
    {
        if (preg_match($regex_string, $_SERVER["HTTP_REFERER"])) return TRUE;
    }
 
// NOT A SEARCH ENGINE
    return FALSE;
}

Open in new window

0
 
MayoorPatelAuthor Commented:
Excellent
0
 
Ray PaseurCommented:
Thanks for the points!  It's a good question, ~Ray
0

Featured Post

Free Tool: ZipGrep

ZipGrep is a utility that can list and search zip (.war, .ear, .jar, etc) archives for text patterns, without the need to extract the archive's contents.

One of a set of tools we're offering as a way to say thank you for being a part of the community.

  • 3
Tackle projects and never again get stuck behind a technical roadblock.
Join Now