Solved

Interpreting entry in raw access log

Posted on 2014-10-30
2
148 Views
Last Modified: 2014-10-30
We have the following entry in our log:

 [09/Oct/2014:15:23:20 -0700] "GET /location1/location2/location3/doc1.pdf HTTP/1.1" 404 - "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
157.55.39.165

This entry is referencing a file that we removed from the server about 4 weeks earlier for security reasons.  In fact, the whole directory was removed.  Trying to get some info on this entry--does it just mean that the Bing Bot was previously aware that the file existed and was just trying to re-verify it or ?

Thanks.
0
Comment
Question by:Jason92s
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
2 Comments
 
LVL 35

Accepted Solution

by:
Seth Simmons earned 500 total points
ID: 40413761
that's what it looks like; trying to verify the link is still valid but returned a 404 so it probably removed it from it's index
0
 
LVL 84

Expert Comment

by:ozo
ID: 40414654
A web link to that entry may have previously existed somewhere (and may still exist somewhere even though the file and directory are removed)
Some bots may also construct links ex nihilo just to see if they can dig up something.
I might even construct a link to it in this post, if I could guess an address for the server
0

Featured Post

Get 15 Days FREE Full-Featured Trial

Benefit from a mission critical IT monitoring with Monitis Premium or get it FREE for your entry level monitoring needs.
-Over 200,000 users
-More than 300,000 websites monitored
-Used in 197 countries
-Recommended by 98% of users

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Introduction This article explores the design of a cache system that can improve the performance of a web site or web application.  The assumption is that the web site has many more “read” operations than “write” operations (this is commonly the ca…
When it comes to showing a 404 error page to your visitors, you do not want that generic page to show, and you especially do not want your hosting provider’s ad error page to show either. In this article, I will show you how to enable the custom 40…
Learn how to get help with Linux/Unix bash shell commands. Use help to read help documents for built in bash shell commands.: Use man to interface with the online reference manuals for shell commands.: Use man to search man pages for unknown command…
This demo shows you how to set up the containerized NetScaler CPX with NetScaler Management and Analytics System in a non-routable Mesos/Marathon environment for use with Micro-Services applications.

729 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question