Solved

Nagios will not send alerts because it is stuck in a hard state

Posted on 2010-08-19
1
834 Views
Last Modified: 2012-05-10
Hi Experts,

Currently running Nagios 3.2.1 and for some reason a few of my services get stuck in hard state 1/5 and dont send alerts (notifications) any ideas?

 Host Nofications work - services dont work properly

Thanks,

EG

MAILFILTER01
 
HTTP
CRITICAL 20-08-2010 12:38:42 0d 10h 46m 27s 1/5 TCP CRITICAL - Invalid hostname, address or socket: mailfilter1.xxx.com

Inbound_Queue
CRITICAL 20-08-2010 12:34:42 0d 10h 45m 26s 1/5 (No output returned from plugin)  

Outbound_Queue
CRITICAL 20-08-2010 12:35:42 0d 10h 44m 26s 1/5 (No output returned from plugin)  


define host{
	use		SERVICES		; Inherit default values from a template
	host_name	MAILFILTER01	 	; The name we're giving to this host
	alias		Barracuda 1 (Mail Server at Epping)  	; A longer name associated with the host
	address		mailfilter01.xxx.xxx.au	; IP address of the host
	contact_groups	CHIT-Tech,CHIT-Managers,admins
	}

define service {
	use			generic-service
	host_name 		MAILFILTER01
	service_description 	Inbound_Queue
	check_command 		Check_Cuda_Queues!in!50!100
	contact_groups		CHIT-Tech,CHIT-Managers,admins
}

define service {
	use			generic-service
	host_name 		MAILFILTER01
	service_description 	Outbound_Queue
	check_command 		Check_Cuda_Queues!out!50!100
	contact_groups		CHIT-Tech,CHIT-Managers,admins
}

	define service {
	use 			generic-service
	host_name 		MAILFILTER01
	service_description 	HTTP
	check_command 		check_tcp!8000 3000.0,80%!5000.0,100%
}

###############################################################################

define service{

        name                            generic-service 

        active_checks_enabled           1       		

        passive_checks_enabled          1    		  

        parallelize_check               1       		

        obsess_over_service             1       		

        check_freshness                 0       		

        notifications_enabled           1       		

        event_handler_enabled           1       		

        flap_detection_enabled          1       		

        failure_prediction_enabled      1       

        process_perf_data               1       		

        retain_status_information       1       		

        retain_nonstatus_information    1       		

        is_volatile                     0       		

        check_period                    24x7			

        max_check_attempts              5			

        normal_check_interval           5			

        retry_check_interval            1			

        contact_groups                  admins			

	notification_options		u,c,r			

        notification_interval           60			

        notification_period             24x7			

         register                       0	     		

        }

Open in new window

0
Comment
Question by:bossagroup
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
1 Comment
 
LVL 9

Accepted Solution

by:
jeremycrussell earned 500 total points
ID: 33485358
It looks like you're possible missing a '!' on the http check_command.  between "8000 3000"

check_command             check_tcp!8000 3000.0,80%!5000.0,100%

Then,  your check_cuda_queues check commands.. can you run those check scripts manualy and get results?  If so, check permissions to make sure the user nagios is running as has permission to run those check scripts.
0

Featured Post

Plug and play, no additional software required!

The ATEN UE3310 USB3.1 Gen1 Extender Cable allows users to extend the distance between the computer and USB devices up to 10 m (33 ft). The UE3310 is a high-quality, cost-effective solution for professional environments such as hospitals, factories and business facilities.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Use of TCL script on Cisco devices:  - create file and merge it with running configuration to apply configuration changes
Fine Tune your automatic Updates for Ubuntu / Debian
This demo shows you how to set up the containerized NetScaler CPX with NetScaler Management and Analytics System in a non-routable Mesos/Marathon environment for use with Micro-Services applications.
In this video we outline the Physical Segments view of NetCrunch network monitor. By following this brief how-to video, you will be able to learn how NetCrunch visualizes your network, how granular is the information collected, as well as where to f…
Suggested Courses

624 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question