• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 153
  • Last Modified:

Interpreting entry in raw access log

We have the following entry in our log:

 [09/Oct/2014:15:23:20 -0700] "GET /location1/location2/location3/doc1.pdf HTTP/1.1" 404 - "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

This entry is referencing a file that we removed from the server about 4 weeks earlier for security reasons.  In fact, the whole directory was removed.  Trying to get some info on this entry--does it just mean that the Bing Bot was previously aware that the file existed and was just trying to re-verify it or ?

1 Solution
Seth SimmonsSr. Systems AdministratorCommented:
that's what it looks like; trying to verify the link is still valid but returned a 404 so it probably removed it from it's index
A web link to that entry may have previously existed somewhere (and may still exist somewhere even though the file and directory are removed)
Some bots may also construct links ex nihilo just to see if they can dig up something.
I might even construct a link to it in this post, if I could guess an address for the server
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

A proven path to a career in data science

At Springboard, we know how to get you a job in data science. With Springboard’s Data Science Career Track, you’ll master data science  with a curriculum built by industry experts. You’ll work on real projects, and get 1-on-1 mentorship from a data scientist.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now