Solved

Detecting google in the referer

Posted on 2009-05-17
4
404 Views
Last Modified: 2012-05-07
I have some code that grabs the referering pages referer url and parses it to detect whether it has the string google in it. However it doesnt seem to detect it. Can some look at my code and give a suggestion on how to make it better.


$referer = $_SERVER['HTTP_REFERER'];
 
  //Did they get here from a search?
  if((preg_match('/www\.google.*/i',$referer) && !preg_match('/^http:\/\/www\.google\.com\//i', $referer))
     || preg_match('/search\.atomz.*/i',$referer)
     || preg_match('/search\.msn.*/i',$referer)
     || preg_match('/search\.yahoo.*/i',$referer)
     || preg_match('/msxml\.excite\.com/i', $referer)
     || preg_match('/search\.lycos\.com/i', $referer)
     || preg_match('/www\.alltheweb\.com/i', $referer)
     || preg_match('/search\.aol\.com/i', $referer)
     || preg_match('/search\.iwon\.com/i', $referer)
     || preg_match('/ask\.com/i', $referer)
     || preg_match('/search\.cometsystems\.com/i', $referer)
     || preg_match('/www\.hotbot\.com/i', $referer)
     || preg_match('/www\.overture\.com/i', $referer)
     || preg_match('/www\.metacrawler\.com/i', $referer)
     || preg_match('/search\.netscape\.com/i', $referer)
     || preg_match('/www\.looksmart\.com/i', $referer)
     || preg_match('/go\.google\.com/i', $referer)
     || preg_match('/dpxml\.webcrawler\.com/i', $referer)
     || preg_match('/search\.earthlink\.net/i', $referer)
     || preg_match('/search\.viewpoint\.com/i', $referer)
     || preg_match('/www\.mamma\.com/i', $referer)
     || preg_match('/home\.bellsouth\.net\/s\/s\.dll/i', $referer)
     || preg_match('/www\.ask\.co\.uk/i', $referer)) {
 
Echo "Search Engine Detected";
 
}
 
Here is the value of the referer varibale.
 
http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8

Open in new window

0
Comment
Question by:MayoorPatel
  • 3
4 Comments
 
LVL 109

Expert Comment

by:Ray Paseur
ID: 24407843
Make a function with an array of the things you want to match.  I will post an example in  a few moments. ~Ray
0
 
LVL 109

Accepted Solution

by:
Ray Paseur earned 500 total points
ID: 24407966
HTH, ~Ray
<?php // RAY_check_refer.php
error_reporting(E_ALL);
 
// REQUIRED READING: http://en.wikipedia.org/wiki/List_of_search_engines
 
// FOR THIS TEST CASE EXAMPLE ONLY WE SET THE REFERER TO A KNOWN VALUE
// $_SERVER["HTTP_REFERER"] = 'http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8';
 
 
 
 
// TEST THE REFERER - WILL THROW NOTICE IF UNDEFINED
if (is_search_engine())  echo "    SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
if (!is_search_engine()) echo "NOT SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
 
 
 
// FUNCTION TO CHECK FOR SEARCH ENGINE REFERRAL
function is_search_engine()
{
// IF NOT REFERED
    if (empty($_SERVER["HTTP_REFERER"])) return FALSE;
 
// A LIST OF SELECTED SEARCH ENGINES
    $r = array();
    $r[] = '/google\.*/i';
    $r[] = '/search\.atomz.*/i';
    $r[] = '/search\.msn.*/i';
    $r[] = '/search\.yahoo.*/i';
    $r[] = '/msxml\.excite\.com/i';
    $r[] = '/search\.lycos\.com/i';
    $r[] = '/www\.alltheweb\.com/i';
    $r[] = '/search\.aol\.com/i';
    $r[] = '/search\.iwon\.com/i';
    $r[] = '/ask\.com/i';
    $r[] = '/search\.cometsystems\.com/i';
    $r[] = '/www\.hotbot\.com/i';
    $r[] = '/www\.overture\.com/i';
    $r[] = '/www\.metacrawler\.com/i';
    $r[] = '/search\.netscape\.com/i';
    $r[] = '/www\.looksmart\.com/i';
    $r[] = '/go\.google\.com/i';
    $r[] = '/dpxml\.webcrawler\.com/i';
    $r[] = '/search\.earthlink\.net/i';
    $r[] = '/search\.viewpoint\.com/i';
    $r[] = '/www\.mamma\.com/i';
    $r[] = '/home\.bellsouth\.net\/s\/s\.dll/i';
    $r[] = '/www\.ask\.co\.uk/i';
 
// TEST IF THIS IS A SEARCH ENGINE
    foreach ($r as $regex_string)
    {
        if (preg_match($regex_string, $_SERVER["HTTP_REFERER"])) return TRUE;
    }
 
// NOT A SEARCH ENGINE
    return FALSE;
}

Open in new window

0
 
LVL 1

Author Closing Comment

by:MayoorPatel
ID: 31582435
Excellent
0
 
LVL 109

Expert Comment

by:Ray Paseur
ID: 25144991
Thanks for the points!  It's a good question, ~Ray
0

Featured Post

Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Build an array called $myWeek which will hold the array elements Today, Yesterday and then builds up the rest of the week by the name of the day going back 1 week.   (CODE) (CODE) Then you just need to pass your date to the function. If i…
This article discusses four methods for overlaying images in a container on a web page
The viewer will learn how to look for a specific file type in a local or remote server directory using PHP.
The viewer will learn how to create a basic form using some HTML5 and PHP for later processing. Set up your basic HTML file. Open your form tag and set the method and action attributes.: (CODE) Set up your first few inputs one for the name and …

828 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question