Solved

Detecting google in the referer

Posted on 2009-05-17
4
419 Views
Last Modified: 2012-05-07
I have some code that grabs the referering pages referer url and parses it to detect whether it has the string google in it. However it doesnt seem to detect it. Can some look at my code and give a suggestion on how to make it better.


$referer = $_SERVER['HTTP_REFERER'];
 
  //Did they get here from a search?
  if((preg_match('/www\.google.*/i',$referer) && !preg_match('/^http:\/\/www\.google\.com\//i', $referer))
     || preg_match('/search\.atomz.*/i',$referer)
     || preg_match('/search\.msn.*/i',$referer)
     || preg_match('/search\.yahoo.*/i',$referer)
     || preg_match('/msxml\.excite\.com/i', $referer)
     || preg_match('/search\.lycos\.com/i', $referer)
     || preg_match('/www\.alltheweb\.com/i', $referer)
     || preg_match('/search\.aol\.com/i', $referer)
     || preg_match('/search\.iwon\.com/i', $referer)
     || preg_match('/ask\.com/i', $referer)
     || preg_match('/search\.cometsystems\.com/i', $referer)
     || preg_match('/www\.hotbot\.com/i', $referer)
     || preg_match('/www\.overture\.com/i', $referer)
     || preg_match('/www\.metacrawler\.com/i', $referer)
     || preg_match('/search\.netscape\.com/i', $referer)
     || preg_match('/www\.looksmart\.com/i', $referer)
     || preg_match('/go\.google\.com/i', $referer)
     || preg_match('/dpxml\.webcrawler\.com/i', $referer)
     || preg_match('/search\.earthlink\.net/i', $referer)
     || preg_match('/search\.viewpoint\.com/i', $referer)
     || preg_match('/www\.mamma\.com/i', $referer)
     || preg_match('/home\.bellsouth\.net\/s\/s\.dll/i', $referer)
     || preg_match('/www\.ask\.co\.uk/i', $referer)) {
 
Echo "Search Engine Detected";
 
}
 
Here is the value of the referer varibale.
 
http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8

Open in new window

0
Comment
Question by:MayoorPatel
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
4 Comments
 
LVL 110

Expert Comment

by:Ray Paseur
ID: 24407843
Make a function with an array of the things you want to match.  I will post an example in  a few moments. ~Ray
0
 
LVL 110

Accepted Solution

by:
Ray Paseur earned 500 total points
ID: 24407966
HTH, ~Ray
<?php // RAY_check_refer.php
error_reporting(E_ALL);
 
// REQUIRED READING: http://en.wikipedia.org/wiki/List_of_search_engines
 
// FOR THIS TEST CASE EXAMPLE ONLY WE SET THE REFERER TO A KNOWN VALUE
// $_SERVER["HTTP_REFERER"] = 'http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8';
 
 
 
 
// TEST THE REFERER - WILL THROW NOTICE IF UNDEFINED
if (is_search_engine())  echo "    SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
if (!is_search_engine()) echo "NOT SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
 
 
 
// FUNCTION TO CHECK FOR SEARCH ENGINE REFERRAL
function is_search_engine()
{
// IF NOT REFERED
    if (empty($_SERVER["HTTP_REFERER"])) return FALSE;
 
// A LIST OF SELECTED SEARCH ENGINES
    $r = array();
    $r[] = '/google\.*/i';
    $r[] = '/search\.atomz.*/i';
    $r[] = '/search\.msn.*/i';
    $r[] = '/search\.yahoo.*/i';
    $r[] = '/msxml\.excite\.com/i';
    $r[] = '/search\.lycos\.com/i';
    $r[] = '/www\.alltheweb\.com/i';
    $r[] = '/search\.aol\.com/i';
    $r[] = '/search\.iwon\.com/i';
    $r[] = '/ask\.com/i';
    $r[] = '/search\.cometsystems\.com/i';
    $r[] = '/www\.hotbot\.com/i';
    $r[] = '/www\.overture\.com/i';
    $r[] = '/www\.metacrawler\.com/i';
    $r[] = '/search\.netscape\.com/i';
    $r[] = '/www\.looksmart\.com/i';
    $r[] = '/go\.google\.com/i';
    $r[] = '/dpxml\.webcrawler\.com/i';
    $r[] = '/search\.earthlink\.net/i';
    $r[] = '/search\.viewpoint\.com/i';
    $r[] = '/www\.mamma\.com/i';
    $r[] = '/home\.bellsouth\.net\/s\/s\.dll/i';
    $r[] = '/www\.ask\.co\.uk/i';
 
// TEST IF THIS IS A SEARCH ENGINE
    foreach ($r as $regex_string)
    {
        if (preg_match($regex_string, $_SERVER["HTTP_REFERER"])) return TRUE;
    }
 
// NOT A SEARCH ENGINE
    return FALSE;
}

Open in new window

0
 
LVL 1

Author Closing Comment

by:MayoorPatel
ID: 31582435
Excellent
0
 
LVL 110

Expert Comment

by:Ray Paseur
ID: 25144991
Thanks for the points!  It's a good question, ~Ray
0

Featured Post

Creating Instructional Tutorials  

For Any Use & On Any Platform

Contextual Guidance at the moment of need helps your employees/users adopt software o& achieve even the most complex tasks instantly. Boost knowledge retention, software adoption & employee engagement with easy solution.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Developers of all skill levels should learn to use current best practices when developing websites. However many developers, new and old, fall into the trap of using deprecated features because this is what so many tutorials and books tell them to u…
Nothing in an HTTP request can be trusted, including HTTP headers and form data.  A form token is a tool that can be used to guard against request forgeries (CSRF).  This article shows an improved approach to form tokens, making it more difficult to…
Learn how to match and substitute tagged data using PHP regular expressions. Demonstrated on Windows 7, but also applies to other operating systems. Demonstrated technique applies to PHP (all versions) and Firefox, but very similar techniques will w…
The viewer will learn how to look for a specific file type in a local or remote server directory using PHP.

717 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question