[Okta Webinar] Learn how to a build a cloud-first strategyRegister Now

x
?
Solved

VMWare ESXI 5.0.0 Host Crashing

Posted on 2014-08-20
25
Medium Priority
?
856 Views
Last Modified: 2014-09-21
For the third time in a month one of my hosts has crashed, it happens when restoring an oracle database. Tricky part here is that the host crashes, not the guest operating system, well, after a while none of the VM's are available. Has anyone here experienced something similar and if so, how was the issue solved?

Thanks for the help.

Regards,

Francisco
0
Comment
Question by:felguera
  • 10
  • 6
  • 5
  • +1
23 Comments
 
LVL 21

Assisted Solution

by:Mazdajai
Mazdajai earned 2000 total points
ID: 40274824
5.0 is out-dated. You should consider updating to 5.1 or 5.5 as there may be known bugs related to host crash in 5.0
0
 

Author Comment

by:felguera
ID: 40274859
That was my second question, can I update the host while it's running or I need to bring it down?
0
 
LVL 124
ID: 40274878
Check your host hardware, CPU, Fans, Memory, run a memory test for 48 hours.

Do you get a PSOD ?

How does it crash? hang?

What do you do to restart?

Updating may not be the answer, if you have a hardware issue, updating brings it's own challenges, is your server supported ?

What version of 5.0, it's completely acceptable to be running 5.0.

You will have to shutdown the Host and VMs, to update.
0
What does it mean to be "Always On"?

Is your cloud always on? With an Always On cloud you won't have to worry about downtime for maintenance or software application code updates, ensuring that your bottom line isn't affected.

 
LVL 21

Expert Comment

by:Mazdajai
ID: 40274881
The upgrade requires a reboot.

If you have vcenter, you can migrate the vms to another ESX host. If this is a standalone host. You will need to shutdown all the vms.
0
 

Author Comment

by:felguera
ID: 40274922
No PSOD, no hardware issues, it is strange, only happens when a VM running RHEL 6 that we are using to run Oracle DB 11G r 2, when restoring the database from a backup, the host looses its network connection, have no idea why and it is just making my head spin.

To restart I have to power cycle the box, no memory issues, the error I get is : "Lost access to volume xxxxxx (machine name) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly"

Then: "Host is not responding"

Then all the vm show as disconnected and then all goes down...

We do have another ESXI host running 5.1 but even when the machines are from the same family, they are a couple years apart in age, cannot do vmotion...
0
 
LVL 124
ID: 40274975
Live vMotion is probably due to mismatched processors!

Are you using an iSCSI datastore ?

What ESXi 5.0 build are you using?

What is the server?

On the HCL ?
0
 
LVL 62

Expert Comment

by:gheist
ID: 40274999
How "everything goes down"
Is redhat vm responding? Can you access shared storage from other hosts?
0
 

Author Comment

by:felguera
ID: 40275167
That is correct, live vMotion is due to mismatched processors, but cannot change them so I am stuck there.

The server is an IBM 3850 M2 120GB RAM

Datastore is connected by fiber channel through a cisco nexus, the pipe is 8GB

SAN is IBM v3700 and have plenty of space

Yes it is on the HCL, we even upgraded the firmware to it's latest version.
0
 

Author Comment

by:felguera
ID: 40275176
Forgot the build, is esxi 5.0.0,623860
0
 
LVL 62

Expert Comment

by:gheist
ID: 40275296
RHEL 6U5 requires ESXi 5.0 U2 while you have U1, so upgrade vmware (with all reboots involved)
0
 
LVL 124
ID: 40275557
The latest version of ESXi 5.0 is Build 1918656, released on the 14 July 2014, so you are little behind!

Update and then re-test.
0
 
LVL 62

Expert Comment

by:gheist
ID: 40275573
U2 should be sufficient to run RHEL6, though later version will rule out more bugs.
0
 

Author Comment

by:felguera
ID: 40276729
So I should update, cannot do it today though, for this is a running production environment. Will post my results.
0
 
LVL 124
ID: 40276760
Yes, I would update ASAP!
0
 
LVL 62

Expert Comment

by:gheist
ID: 40277179
At least get to 5.0.0 Update 3
If you have vcenter it would be a good time to consider getting VUM to work.
0
 

Author Comment

by:felguera
ID: 40329170
More issues coming up, EE thinks I had abandoned the issue, not at all. We purchased a server and setup esxi 5.5 on it, moved the machine to that server and now we are getting an nvram write failed error. Again, will update when more information is available. Maddening, this is just insane.
0
 
LVL 124
ID: 40329180
I think it would now be a good time to open a new question on this recent issue has it has nothing to do with the original question asked.

Did you upgrade the host in this question?
0
 

Author Comment

by:felguera
ID: 40329184
Yes I did, new host, same virtual machine. You are right I will close this question and move on to a new one.
Thanks!

FE
0
 
LVL 62

Expert Comment

by:gheist
ID: 40329212
I think we wanted to know IF you
1) tried to validate old hardware with memtest
2) upgraded to at least formally supported ESX(i) for RHEL 6U5
0
 

Author Comment

by:felguera
ID: 40329229
Gheist,

Memory tests were successful
There is no point in upgrading the old machine for the processors are not compatible, even when the two machines are from the same family. We purchased a machine that is exactly the same as the one we are going to keep on our environment, thank the internet and ebay! so we would be able to do vmotion and move on. I will open a new question for the new issue I am facing.
Thanks all for your comments.

FE
0
 

Accepted Solution

by:
felguera earned 0 total points
ID: 40330601
I don't know how to close this question, so I am going to post my solution right here:

Replaced machine by one that would be compatible to the newest member of the environment, installed esxi 5.5.0 and then configured storage connections and fabric connections, afterwards moved the vm that was crashing the host to the new box.
Since the issue was continuing I did a couple of changes on the vm, changed SCSI to paravirtual and disabled (in options) fibre channel NPIV. After that was done I tried to restore the database again, this time the process was successful.

Thanks for the help.

FE
0
 

Author Comment

by:felguera
ID: 40330694
I've requested that this question be closed as follows:

Accepted answer: 0 points for felguera's comment #a40330601

for the following reason:

It required me a lot of time to figure this one out, the obvious was stated by other users, updating to 5.5.0 made it possible for the hardware to use paravirtual SCSI which was not possible with the earlier version and disabling the fibre channel npiv.
0
 
LVL 62

Expert Comment

by:gheist
ID: 40330695
You silently rang to accept Hancooka's first of the first comments and give some beef to thers asking you to upgrade ESXi
0

Featured Post

[Webinar] Cloud and Mobile-First Strategy

Maybe you’ve fully adopted the cloud since the beginning. Or maybe you started with on-prem resources but are pursuing a “cloud and mobile first” strategy. Getting to that end state has its challenges. Discover how to build out a 100% cloud and mobile IT strategy in this webinar.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In this article, I will show you HOW TO: Perform a Physical to Virtual (P2V) Conversion the easy way from a computer backup (image).
In the first part of this tutorial we will cover the prerequisites for installing SQL Server vNext on Linux.
Teach the user how to install and configure the vCenter Orchestrator virtual appliance Open vSphere Web Client: Deploy vCenter Orchestrator virtual appliance OVA file: Verify vCenter Orchestrator virtual appliance boots successfully: Connect to the …
Teach the user how to use vSphere Update Manager to update the VMware Tools and virtual machine hardware version Open vSphere Client: Review manual processes for updating VMware Tools and virtual hardware versions: Create a new baseline group in vSp…
Suggested Courses

834 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question