Link to home
Start Free TrialLog in
Avatar of codeQuantum
codeQuantumFlag for Canada

asked on

GoogleBot looking for a 'function.require' file or directory

Hi,

I have a custom 404 page that send me emails (using PHP mail function) when there is a file not found on my server. Today I received this :

Sat Jun 21 2008 1:35:20 pm CDT
66.249.65.99 // crawl-66-249-65-99.googlebot.com tried to load http://www.mydomain.com/function.require
User Agent /Browser = Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)

GoogleBot seems to be looking for a file called "function.require" on my website. Can anybody explain what is going on?

I have 2 such errors, second one was with "function.main" instead of "function.require"
Avatar of codeQuantum
codeQuantum
Flag of Canada image

ASKER

This morning, I got error reports showing that MSN bot is doing the same thing as GoogleBot :

65.55.212.240 // livebot-65-55-212-240.search.live.com tried to load http://www.louisst-jean.com/function.require
User Agent /Browser = msnbot-media/1.0 (+http://search.msn.com/msnbot.htm)

And :

65.55.212.240 // livebot-65-55-212-240.search.live.com tried to load http://www.louisst-jean.com/function.main
User Agent /Browser = msnbot-media/1.0 (+http://search.msn.com/msnbot.htm)

I am starting to think there is some code on my site that induce the spiders in error and make them think there are such links as function.main and function.require. I made a search on my code (using dreamweaver "site wide" code search) for "function.main", and this text appears nowhere on my code...

How do I fix this?
SOLUTION
Avatar of Michel Plungjan
Michel Plungjan
Flag of Denmark image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
I have my doubts about that theory.

1) I don't have any PHP require in my code. I only use the require_once() function. (I verified a minute ago, so I am sure of that.)
2) I am aware that PHP can return an error code with function.require. But why would the spiders apply that as a URL?

There are require functions in javascript too. By the way, I am using Google Analytics, Scriptalicious, Prototype, and Lightbox on my site.
ASKER CERTIFIED SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Wow. So the spider is visiting a page... where it is getting a mysql error... and it indexed the function.require link and is now trying to read it...

Mystery solved, thanks.