• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 864
  • Last Modified:

Nagios will not send alerts because it is stuck in a hard state

Hi Experts,

Currently running Nagios 3.2.1 and for some reason a few of my services get stuck in hard state 1/5 and dont send alerts (notifications) any ideas?

 Host Nofications work - services dont work properly

Thanks,

EG

MAILFILTER01
 
HTTP
CRITICAL 20-08-2010 12:38:42 0d 10h 46m 27s 1/5 TCP CRITICAL - Invalid hostname, address or socket: mailfilter1.xxx.com

Inbound_Queue
CRITICAL 20-08-2010 12:34:42 0d 10h 45m 26s 1/5 (No output returned from plugin)  

Outbound_Queue
CRITICAL 20-08-2010 12:35:42 0d 10h 44m 26s 1/5 (No output returned from plugin)  


define host{
	use		SERVICES		; Inherit default values from a template
	host_name	MAILFILTER01	 	; The name we're giving to this host
	alias		Barracuda 1 (Mail Server at Epping)  	; A longer name associated with the host
	address		mailfilter01.xxx.xxx.au	; IP address of the host
	contact_groups	CHIT-Tech,CHIT-Managers,admins
	}

define service {
	use			generic-service
	host_name 		MAILFILTER01
	service_description 	Inbound_Queue
	check_command 		Check_Cuda_Queues!in!50!100
	contact_groups		CHIT-Tech,CHIT-Managers,admins
}

define service {
	use			generic-service
	host_name 		MAILFILTER01
	service_description 	Outbound_Queue
	check_command 		Check_Cuda_Queues!out!50!100
	contact_groups		CHIT-Tech,CHIT-Managers,admins
}

	define service {
	use 			generic-service
	host_name 		MAILFILTER01
	service_description 	HTTP
	check_command 		check_tcp!8000 3000.0,80%!5000.0,100%
}

###############################################################################

define service{

        name                            generic-service 

        active_checks_enabled           1       		

        passive_checks_enabled          1    		  

        parallelize_check               1       		

        obsess_over_service             1       		

        check_freshness                 0       		

        notifications_enabled           1       		

        event_handler_enabled           1       		

        flap_detection_enabled          1       		

        failure_prediction_enabled      1       

        process_perf_data               1       		

        retain_status_information       1       		

        retain_nonstatus_information    1       		

        is_volatile                     0       		

        check_period                    24x7			

        max_check_attempts              5			

        normal_check_interval           5			

        retry_check_interval            1			

        contact_groups                  admins			

	notification_options		u,c,r			

        notification_interval           60			

        notification_period             24x7			

         register                       0	     		

        }

Open in new window

0
bossagroup
Asked:
bossagroup
1 Solution
 
jeremycrussellCommented:
It looks like you're possible missing a '!' on the http check_command.  between "8000 3000"

check_command             check_tcp!8000 3000.0,80%!5000.0,100%

Then,  your check_cuda_queues check commands.. can you run those check scripts manualy and get results?  If so, check permissions to make sure the user nagios is running as has permission to run those check scripts.
0

Featured Post

Become an Android App Developer

Ready to kick start your career in 2018? Learn how to build an Android app in January’s Course of the Month and open the door to new opportunities.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now