Solved

redhat cluster

Posted on 2014-02-15
6
670 Views
Last Modified: 2014-03-02
I recently setup redhat cluster.  fencing is working fine.  I created a new service group for apache. when i tried to do manually everything fine. If i try to start the service its failing. Please advise.

[root@node1 ~]# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster config_version="31" name="prod">
        <clusternodes>
                <clusternode name="node1.rnd.ca" nodeid="1">
                        <fence>
                                <method name="vmfence">
                                        <device name="fence_testserver1" ssl="on" uuid="423331fe-8301-8171-5079-9a5007a792d4"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node2.rnd.ca" nodeid="2">
                        <fence>
                                <method name="vmfence">
                                        <device name="fence_testserver1" ssl="on" uuid="4233f57e-a8fc-8fce-b413-ff39ec992fcc"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="node3.rnd.ca" nodeid="3">
                        <fence>
                                <method name="vmfence">
                                        <device name="fence_testserver1" ssl="on" uuid="4233c09b-347e-b13b-cb9b-6c88d3101e3c"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <rm>
                <resources>
                        <lvm lv_name="halv" name="halvm" self_fence="on" vg_name="webvg"/>
                        <apache config_file="conf/httpd.conf" name="web-server" server_root="/etc/httpd" shutdown_wait="0"/>
                        <fs device="/dev/webvg/halv" force_unmount="on" fsid="49557" mountpoint="/var/www" name="wwwfs" quick_status="on" self_fence="on"/>
                        <ip address="192.168.2.35/255.255.255.0" disable_rdisc="on" sleeptime="5"/>
                </resources>
                <failoverdomains>
                        <failoverdomain name="web" ordered="1">
                                <failoverdomainnode name="node1.rnd.ca" priority="1"/>
                                <failoverdomainnode name="node2.rnd.ca" priority="1"/>
                                <failoverdomainnode name="node3.rnd.ca" priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <service domain="web" name="webSG" recovery="relocate">
                        <lvm ref="halvm"/>
                        <fs ref="wwwfs"/>
                        <apache ref="web-server"/>
                        <ip ref="192.168.2.35/255.255.255.0"/>
                </service>
        </rm>
        <fencedevices>
                <fencedevice agent="fence_vmware_soap" ipaddr="192.168.2.23" login="administrator" name="vmware-fence" passwd="Oracle@to" power_wait="5"/>
                <fencedevice agent="fence_vmware_soap" ipaddr="192.168.2.23" login="administrator" name="fence_testserver1" passwd="Oracle@to"/>
        </fencedevices>
</cluster>




[root@node1 ~]# tailf /var/log/messages
Feb 15 20:37:56 node1 rgmanager[10440]: Service service:webSG is recovering
Feb 15 20:37:56 node1 rgmanager[10440]: #71: Relocating failed service service:webSG
Feb 15 20:38:02 node1 rgmanager[10440]: Service service:webSG is stopped
Feb 15 20:39:54 node1 kernel: EXT4-fs (dm-2): mounted filesystem with ordered data mode. Opts:
Feb 15 20:40:39 node1 rgmanager[10440]: Stopping service service:webSG
Feb 15 20:40:40 node1 rgmanager[13277]: [apache] Stopping Service apache:web-server
Feb 15 20:40:40 node1 rgmanager[13299]: [apache] Checking Existence Of File /var/run/cluster/apache/apache:web-server.pid [apache:web-server] > Failed - File Doesn't
Feb 15 20:40:40 node1 rgmanager[13321]: [apache] Stopping Service apache:web-server > Succeed
Feb 15 20:40:40 node1 rgmanager[13428]: [fs] unmounting /var/www
Feb 15 20:40:41 node1 rgmanager[10440]: Service service:webSG is disabled


^C
[root@node1 ~]# date
Sat Feb 15 20:41:13 EST 2014
[root@node1 ~]#
[root@node1 ~]#
[root@node1 ~]# clustat
Cluster Status for prod @ Sat Feb 15 20:43:57 2014
Member Status: Quorate

 Member Name                                                     ID   Status
 ------ ----                                                     ---- ------
 node1.rnd.ca                                                        1 Online, Local, rgmanager
 node2.rnd.ca                                                        2 Online, rgmanager
 node3.rnd.ca                                                        3 Online, rgmanager

 Service Name                                                     Owner (Last)                                                     State
 ------- ----                                                     ----- ------                                                     -----
 service:webSG                                                    (node1.rnd.ca)                                                   failed
0
Comment
Question by:ittechlab
  • 3
  • 2
6 Comments
 
LVL 28

Expert Comment

by:asavener
ID: 39867415
OK, I have to ask:  The whole purpose of using RedHat over some other distro is to get support.  Have you opened a ticket with them about this?
0
 
LVL 7

Expert Comment

by:multimac
ID: 39870035
Does the directory
/var/run/cluster/apache/
exist?

If not, recreate
mkdir -p /var/run/cluster/apache/
0
 

Author Comment

by:ittechlab
ID: 39870333
in what order i have to create the service group.

here are the resources i am using

HA-LVM
File system
IP - floating IP
Apache
0
Backup Your Microsoft Windows Server®

Backup all your Microsoft Windows Server – on-premises, in remote locations, in private and hybrid clouds. Your entire Windows Server will be backed up in one easy step with patented, block-level disk imaging. We achieve RTOs (recovery time objectives) as low as 15 seconds.

 
LVL 7

Expert Comment

by:multimac
ID: 39870348
The order is ok
0
 

Accepted Solution

by:
ittechlab earned 0 total points
ID: 39870361
I am going to recreate the service group and lets see.
0
 

Author Closing Comment

by:ittechlab
ID: 39898362
my solution worked out.
0

Featured Post

VMware Disaster Recovery and Data Protection

In this expert guide, you’ll learn about the components of a Modern Data Center. You will use cases for the value-added capabilities of Veeam®, including combining backup and replication for VMware disaster recovery and using replication for data center migration.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

It Is not possible to enable LLDP in vSwitch(at least is not supported by VMware), so in this article we will enable this, and also go trough how to enabled CDP and how to get this information in vSwitches and also in vDS.
In this article, I am going to show you how to simulate a multi-site Lab environment on a single Hyper-V host. I use this method successfully in my own lab to simulate three fully routed global AD Sites on a Windows 10 Hyper-V host.
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
This demo shows you how to set up the containerized NetScaler CPX with NetScaler Management and Analytics System in a non-routable Mesos/Marathon environment for use with Micro-Services applications.

911 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

16 Experts available now in Live!

Get 1:1 Help Now