Solved

Detecting google in the referer

Posted on 2009-05-17
4
397 Views
Last Modified: 2012-05-07
I have some code that grabs the referering pages referer url and parses it to detect whether it has the string google in it. However it doesnt seem to detect it. Can some look at my code and give a suggestion on how to make it better.


$referer = $_SERVER['HTTP_REFERER'];
 

  //Did they get here from a search?

  if((preg_match('/www\.google.*/i',$referer) && !preg_match('/^http:\/\/www\.google\.com\//i', $referer))

     || preg_match('/search\.atomz.*/i',$referer)

     || preg_match('/search\.msn.*/i',$referer)

     || preg_match('/search\.yahoo.*/i',$referer)

     || preg_match('/msxml\.excite\.com/i', $referer)

     || preg_match('/search\.lycos\.com/i', $referer)

     || preg_match('/www\.alltheweb\.com/i', $referer)

     || preg_match('/search\.aol\.com/i', $referer)

     || preg_match('/search\.iwon\.com/i', $referer)

     || preg_match('/ask\.com/i', $referer)

     || preg_match('/search\.cometsystems\.com/i', $referer)

     || preg_match('/www\.hotbot\.com/i', $referer)

     || preg_match('/www\.overture\.com/i', $referer)

     || preg_match('/www\.metacrawler\.com/i', $referer)

     || preg_match('/search\.netscape\.com/i', $referer)

     || preg_match('/www\.looksmart\.com/i', $referer)

     || preg_match('/go\.google\.com/i', $referer)

     || preg_match('/dpxml\.webcrawler\.com/i', $referer)

     || preg_match('/search\.earthlink\.net/i', $referer)

     || preg_match('/search\.viewpoint\.com/i', $referer)

     || preg_match('/www\.mamma\.com/i', $referer)

     || preg_match('/home\.bellsouth\.net\/s\/s\.dll/i', $referer)

     || preg_match('/www\.ask\.co\.uk/i', $referer)) {
 

Echo "Search Engine Detected";
 

}
 

Here is the value of the referer varibale.
 

http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8

Open in new window

0
Comment
Question by:MayoorPatel
  • 3
4 Comments
 
LVL 108

Expert Comment

by:Ray Paseur
Comment Utility
Make a function with an array of the things you want to match.  I will post an example in  a few moments. ~Ray
0
 
LVL 108

Accepted Solution

by:
Ray Paseur earned 500 total points
Comment Utility
HTH, ~Ray
<?php // RAY_check_refer.php

error_reporting(E_ALL);
 

// REQUIRED READING: http://en.wikipedia.org/wiki/List_of_search_engines
 

// FOR THIS TEST CASE EXAMPLE ONLY WE SET THE REFERER TO A KNOWN VALUE

// $_SERVER["HTTP_REFERER"] = 'http://www.google.com/search?client=safari&rls=en-us&q=Danbury+CT+chive&ie=UTF-8&oe=UTF-8';
 
 
 
 

// TEST THE REFERER - WILL THROW NOTICE IF UNDEFINED

if (is_search_engine())  echo "    SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";

if (!is_search_engine()) echo "NOT SEARCH ENGINE: {$_SERVER["HTTP_REFERER"]}";
 
 
 

// FUNCTION TO CHECK FOR SEARCH ENGINE REFERRAL

function is_search_engine()

{

// IF NOT REFERED

    if (empty($_SERVER["HTTP_REFERER"])) return FALSE;
 

// A LIST OF SELECTED SEARCH ENGINES

    $r = array();

    $r[] = '/google\.*/i';

    $r[] = '/search\.atomz.*/i';

    $r[] = '/search\.msn.*/i';

    $r[] = '/search\.yahoo.*/i';

    $r[] = '/msxml\.excite\.com/i';

    $r[] = '/search\.lycos\.com/i';

    $r[] = '/www\.alltheweb\.com/i';

    $r[] = '/search\.aol\.com/i';

    $r[] = '/search\.iwon\.com/i';

    $r[] = '/ask\.com/i';

    $r[] = '/search\.cometsystems\.com/i';

    $r[] = '/www\.hotbot\.com/i';

    $r[] = '/www\.overture\.com/i';

    $r[] = '/www\.metacrawler\.com/i';

    $r[] = '/search\.netscape\.com/i';

    $r[] = '/www\.looksmart\.com/i';

    $r[] = '/go\.google\.com/i';

    $r[] = '/dpxml\.webcrawler\.com/i';

    $r[] = '/search\.earthlink\.net/i';

    $r[] = '/search\.viewpoint\.com/i';

    $r[] = '/www\.mamma\.com/i';

    $r[] = '/home\.bellsouth\.net\/s\/s\.dll/i';

    $r[] = '/www\.ask\.co\.uk/i';
 

// TEST IF THIS IS A SEARCH ENGINE

    foreach ($r as $regex_string)

    {

        if (preg_match($regex_string, $_SERVER["HTTP_REFERER"])) return TRUE;

    }
 

// NOT A SEARCH ENGINE

    return FALSE;

}

Open in new window

0
 
LVL 1

Author Closing Comment

by:MayoorPatel
Comment Utility
Excellent
0
 
LVL 108

Expert Comment

by:Ray Paseur
Comment Utility
Thanks for the points!  It's a good question, ~Ray
0

Featured Post

Free Trending Threat Insights Every Day

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

Join & Write a Comment

I imagine that there are some, like me, who require a way of getting currency exchange rates for implementation in web project from time to time, so I thought I would share a solution that I have developed for this purpose. It turns out that Yaho…
These days socially coordinated efforts have turned into a critical requirement for enterprises.
The viewer will learn how to count occurrences of each item in an array.
The viewer will learn how to create and use a small PHP class to apply a watermark to an image. This video shows the viewer the setup for the PHP watermark as well as important coding language. Continue to Part 2 to learn the core code used in creat…

744 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

15 Experts available now in Live!

Get 1:1 Help Now