Solved

Nagios will not send alerts because it is stuck in a hard state

Posted on 2010-08-19
1
756 Views
Last Modified: 2012-05-10
Hi Experts,

Currently running Nagios 3.2.1 and for some reason a few of my services get stuck in hard state 1/5 and dont send alerts (notifications) any ideas?

 Host Nofications work - services dont work properly

Thanks,

EG

MAILFILTER01
 
HTTP
CRITICAL 20-08-2010 12:38:42 0d 10h 46m 27s 1/5 TCP CRITICAL - Invalid hostname, address or socket: mailfilter1.xxx.com

Inbound_Queue
CRITICAL 20-08-2010 12:34:42 0d 10h 45m 26s 1/5 (No output returned from plugin)  

Outbound_Queue
CRITICAL 20-08-2010 12:35:42 0d 10h 44m 26s 1/5 (No output returned from plugin)  


define host{

	use		SERVICES		; Inherit default values from a template

	host_name	MAILFILTER01	 	; The name we're giving to this host

	alias		Barracuda 1 (Mail Server at Epping)  	; A longer name associated with the host

	address		mailfilter01.xxx.xxx.au	; IP address of the host

	contact_groups	CHIT-Tech,CHIT-Managers,admins

	}



define service {

	use			generic-service

	host_name 		MAILFILTER01

	service_description 	Inbound_Queue

	check_command 		Check_Cuda_Queues!in!50!100

	contact_groups		CHIT-Tech,CHIT-Managers,admins

}



define service {

	use			generic-service

	host_name 		MAILFILTER01

	service_description 	Outbound_Queue

	check_command 		Check_Cuda_Queues!out!50!100

	contact_groups		CHIT-Tech,CHIT-Managers,admins

}



	define service {

	use 			generic-service

	host_name 		MAILFILTER01

	service_description 	HTTP

	check_command 		check_tcp!8000 3000.0,80%!5000.0,100%

}



###############################################################################



define service{



        name                            generic-service 



        active_checks_enabled           1       		



        passive_checks_enabled          1    		  



        parallelize_check               1       		



        obsess_over_service             1       		



        check_freshness                 0       		



        notifications_enabled           1       		



        event_handler_enabled           1       		



        flap_detection_enabled          1       		



        failure_prediction_enabled      1       



        process_perf_data               1       		



        retain_status_information       1       		



        retain_nonstatus_information    1       		



        is_volatile                     0       		



        check_period                    24x7			



        max_check_attempts              5			



        normal_check_interval           5			



        retry_check_interval            1			



        contact_groups                  admins			



	notification_options		u,c,r			



        notification_interval           60			



        notification_period             24x7			



         register                       0	     		



        }

Open in new window

0
Comment
Question by:bossagroup
1 Comment
 
LVL 9

Accepted Solution

by:
jeremycrussell earned 500 total points
ID: 33485358
It looks like you're possible missing a '!' on the http check_command.  between "8000 3000"

check_command             check_tcp!8000 3000.0,80%!5000.0,100%

Then,  your check_cuda_queues check commands.. can you run those check scripts manualy and get results?  If so, check permissions to make sure the user nagios is running as has permission to run those check scripts.
0

Featured Post

Find Ransomware Secrets With All-Source Analysis

Ransomware has become a major concern for organizations; its prevalence has grown due to past successes achieved by threat actors. While each ransomware variant is different, we’ve seen some common tactics and trends used among the authors of the malware.

Join & Write a Comment

It’s 2016. Password authentication should be dead — or at least close to dying. But, unfortunately, it has not traversed Quagga stage yet. Using password authentication is like laundering hotel guest linens with a washboard — it’s Passé.
Join Greg Farro and Ethan Banks from Packet Pushers (http://packetpushers.net/podcast/podcasts/pq-show-93-smart-network-monitoring-paessler-sponsored/) and Greg Ross from Paessler (https://www.paessler.com/prtg) for a discussion about smart network …
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…
Here's a very brief overview of the methods PRTG Network Monitor (https://www.paessler.com/prtg) offers for monitoring bandwidth, to help you decide which methods you´d like to investigate in more detail.  The methods are covered in more detail in o…

705 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

18 Experts available now in Live!

Get 1:1 Help Now