Solved

Interpreting entry in raw access log

Posted on 2014-10-30
2
143 Views
Last Modified: 2014-10-30
We have the following entry in our log:

 [09/Oct/2014:15:23:20 -0700] "GET /location1/location2/location3/doc1.pdf HTTP/1.1" 404 - "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
157.55.39.165

This entry is referencing a file that we removed from the server about 4 weeks earlier for security reasons.  In fact, the whole directory was removed.  Trying to get some info on this entry--does it just mean that the Bing Bot was previously aware that the file existed and was just trying to re-verify it or ?

Thanks.
0
Comment
Question by:Jason92s
2 Comments
 
LVL 34

Accepted Solution

by:
Seth Simmons earned 500 total points
ID: 40413761
that's what it looks like; trying to verify the link is still valid but returned a 404 so it probably removed it from it's index
0
 
LVL 84

Expert Comment

by:ozo
ID: 40414654
A web link to that entry may have previously existed somewhere (and may still exist somewhere even though the file and directory are removed)
Some bots may also construct links ex nihilo just to see if they can dig up something.
I might even construct a link to it in this post, if I could guess an address for the server
0

Featured Post

Gigs: Get Your Project Delivered by an Expert

Select from freelancers specializing in everything from database administration to programming, who have proven themselves as experts in their field. Hire the best, collaborate easily, pay securely and get projects done right.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Article by: kevp75
Hey folks, 'bout time for me to come around with a little tip. Thanks to IIS 7.5 Extensions and Microsoft (well... really Windows 8, and IIS 8 I guess...), we can now prime our Application Pools, when IIS starts. Now, though it would be nice t…
If you've heard about htaccess and it sounds like it does what you want, but you're not sure how it works... well, you're in the right place. Read on. Some Basics #1. It's a file and its filename is .htaccess (yes, with a dot in the front). #…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
Connecting to an Amazon Linux EC2 Instance from Windows Using PuTTY.

785 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question