Go Premium for a chance to win a PS4. Enter to Win

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 151
  • Last Modified:

Interpreting entry in raw access log

We have the following entry in our log:

 [09/Oct/2014:15:23:20 -0700] "GET /location1/location2/location3/doc1.pdf HTTP/1.1" 404 - "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
157.55.39.165

This entry is referencing a file that we removed from the server about 4 weeks earlier for security reasons.  In fact, the whole directory was removed.  Trying to get some info on this entry--does it just mean that the Bing Bot was previously aware that the file existed and was just trying to re-verify it or ?

Thanks.
0
Jason92s
Asked:
Jason92s
1 Solution
 
Seth SimmonsSr. Systems AdministratorCommented:
that's what it looks like; trying to verify the link is still valid but returned a 404 so it probably removed it from it's index
0
 
ozoCommented:
A web link to that entry may have previously existed somewhere (and may still exist somewhere even though the file and directory are removed)
Some bots may also construct links ex nihilo just to see if they can dig up something.
I might even construct a link to it in this post, if I could guess an address for the server
0

Featured Post

Nothing ever in the clear!

This technical paper will help you implement VMware’s VM encryption as well as implement Veeam encryption which together will achieve the nothing ever in the clear goal. If a bad guy steals VMs, backups or traffic they get nothing.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now