Go Premium for a chance to win a PS4. Enter to Win

x
?
Solved

Nagios will not send alerts because it is stuck in a hard state

Posted on 2010-08-19
1
Medium Priority
?
855 Views
Last Modified: 2012-05-10
Hi Experts,

Currently running Nagios 3.2.1 and for some reason a few of my services get stuck in hard state 1/5 and dont send alerts (notifications) any ideas?

 Host Nofications work - services dont work properly

Thanks,

EG

MAILFILTER01
 
HTTP
CRITICAL 20-08-2010 12:38:42 0d 10h 46m 27s 1/5 TCP CRITICAL - Invalid hostname, address or socket: mailfilter1.xxx.com

Inbound_Queue
CRITICAL 20-08-2010 12:34:42 0d 10h 45m 26s 1/5 (No output returned from plugin)  

Outbound_Queue
CRITICAL 20-08-2010 12:35:42 0d 10h 44m 26s 1/5 (No output returned from plugin)  


define host{
	use		SERVICES		; Inherit default values from a template
	host_name	MAILFILTER01	 	; The name we're giving to this host
	alias		Barracuda 1 (Mail Server at Epping)  	; A longer name associated with the host
	address		mailfilter01.xxx.xxx.au	; IP address of the host
	contact_groups	CHIT-Tech,CHIT-Managers,admins
	}

define service {
	use			generic-service
	host_name 		MAILFILTER01
	service_description 	Inbound_Queue
	check_command 		Check_Cuda_Queues!in!50!100
	contact_groups		CHIT-Tech,CHIT-Managers,admins
}

define service {
	use			generic-service
	host_name 		MAILFILTER01
	service_description 	Outbound_Queue
	check_command 		Check_Cuda_Queues!out!50!100
	contact_groups		CHIT-Tech,CHIT-Managers,admins
}

	define service {
	use 			generic-service
	host_name 		MAILFILTER01
	service_description 	HTTP
	check_command 		check_tcp!8000 3000.0,80%!5000.0,100%
}

###############################################################################

define service{

        name                            generic-service 

        active_checks_enabled           1       		

        passive_checks_enabled          1    		  

        parallelize_check               1       		

        obsess_over_service             1       		

        check_freshness                 0       		

        notifications_enabled           1       		

        event_handler_enabled           1       		

        flap_detection_enabled          1       		

        failure_prediction_enabled      1       

        process_perf_data               1       		

        retain_status_information       1       		

        retain_nonstatus_information    1       		

        is_volatile                     0       		

        check_period                    24x7			

        max_check_attempts              5			

        normal_check_interval           5			

        retry_check_interval            1			

        contact_groups                  admins			

	notification_options		u,c,r			

        notification_interval           60			

        notification_period             24x7			

         register                       0	     		

        }

Open in new window

0
Comment
Question by:bossagroup
1 Comment
 
LVL 9

Accepted Solution

by:
jeremycrussell earned 2000 total points
ID: 33485358
It looks like you're possible missing a '!' on the http check_command.  between "8000 3000"

check_command             check_tcp!8000 3000.0,80%!5000.0,100%

Then,  your check_cuda_queues check commands.. can you run those check scripts manualy and get results?  If so, check permissions to make sure the user nagios is running as has permission to run those check scripts.
0

Featured Post

Automating Your MSP Business

The road to profitability.
Delivering superior services is key to ensuring customer satisfaction and the consequent long-term relationships that enable MSPs to lock in predictable, recurring revenue. What's the best way to deliver superior service? One word: automation.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

How to set-up an On Demand, IPSec, Site to SIte, VPN from a Draytek Vigor Router to a Cyberoam UTM Appliance. A concise guide to the settings required on both devices
I have written articles previously comparing SARDU and YUMI.  I also included a couple of lines about Easy2boot (easy2boot.com).  I have now been using, and enjoying easy2boot as my sole multiboot utility for some years and realize that it deserves …
Here's a very brief overview of the methods PRTG Network Monitor (https://www.paessler.com/prtg) offers for monitoring bandwidth, to help you decide which methods you´d like to investigate in more detail.  The methods are covered in more detail in o…
Michael from AdRem Software explains how to view the most utilized and worst performing nodes in your network, by accessing the Top Charts view in NetCrunch network monitor (https://www.adremsoft.com/). Top Charts is a view in which you can set seve…
Suggested Courses
Course of the Month10 days, 22 hours left to enroll

885 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question