Link to home
Start Free TrialLog in
Avatar of marceloNYC
marceloNYCFlag for United States of America

asked on

Nagios critical error 800 days old

Hello Experts,

I started here in April and I m trying to clean house.

We use Nagios religously.

There are this errors that I want to clean up. This one I am not sure where to begin:

commsrv1
      
Debian GNU/Linux
      
Internal DNS
      
Notifications for this service have been disabled
      CRITICAL       2015-07-31 09:36:41       890d 16h 6m 48s       4/4       DNS CRITICAL - 0.023 seconds response time (No ANSWER SECTION found)

If you could give me an idea where to begin please.

Thank you!
Screenshot-from-2015-07-31-09-50-12.png
Avatar of Dan Craciun
Dan Craciun
Flag of Romania image

Well, almost 3 years ago your DNS server changed, the person in charge of Nagios did not know how to change the check and disabled the notifications so it would not bother him/her.

Check the services inside /etc/nagios3/conf.d/ until you find the culprit.

HTH,
Dan
i would look at the command being used
i'm guessing it is using check_dig
look at the command line; possible it was not setup correctly
Avatar of marceloNYC

ASKER

Aha! I found this fellow in services conf file

# check that ssh services are running
define service {
    hostgroup_name           internal-dns-servers
    service_description      Internal DNS
    check_command            check_dig!coxxx-server.corp.ok7-eleven.com
    use                      generic-service
}

define service {
    hostgroup_name           store-internal-dns-servers
    service_description      Internal DNS
    check_command            check_dig!coxxx-server.corp.ok7-eleven.com
    normal_check_interval    10
    max_check_attempts       10
    use                      generic-service
}
ASKER CERTIFIED SOLUTION
Avatar of Dan Craciun
Dan Craciun
Flag of Romania image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Digging... need a few
look at the command using check_dig and see what parameters are being used
Okay,

For sure the solution is in the two config files .

My problem now is finding out what server should go where.

There 3 to choose from.

The one that is in the command check_dig! does seem to be doing anything now yet it does respond to a ping.

Asking around to get the right idea.
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
I am stuck figuring out what name DNS server to put there ( I found 4). That is a beside the issue that you helped me with

Thank you!