• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 786
  • Last Modified:

HP ML350 G4 windows 2003 Storage server sp1 Hangs with out any error

Hi

We have ML350 G4 windows 2003 storage server with SP1. it has 4GB of RAM and
RAID 1 & 5 configured.

We are having a problem on this server hanging from very beginning and HP
support has changed motherboard, RAM still hanging(we can not really say when
this server hangs mostly when we needed most). We did test with HP's test
software and doesn't show any error on hardware. As this is our main file
server we thought we restart (scheduled) server every morning this will go
away but no success.

We were wondering if any one could suggest what we can do here. We have
latest drivers from HP installed.

Thank you in advance

0
agurung
Asked:
agurung
  • 10
  • 5
  • 4
5 Solutions
 
Darwinian999Commented:
Call HP every time it hangs. They'll probably change the power supplies next. HP have excellent warranty support, and they generally do everything they can to get things fixed to their customers satisfaction.

One of our customers has an ML370 G2 that was going out of warranty. They didn't want to replace the server, so we advised them to pay HP for extended warranty. It payed for itself in a few months - the server started to intermittently hang after a nearby lightening strike, so HP swapped out everything one-at-a-time, then all-at-once with a completely different set of parts to get it going.
0
 
agurungAuthor Commented:
Hi Darwinian999

Thanks for your reply.. we do not have very good experience with HP support though. We have support agreement of 24/7 but if we log a call around 3:00pm we never get a call back till next day afternoon and forget about them calling back in weekend if that falls on weekend.............. we are having problem with this serve from very first day we received... seems like we will have never ending problem with this server.

any way i went through the KB articles you send me and  we seems to have same situation as describe on KB "http://support.microsoft.com/kb/890018/en-us " we have asked microsoft to send us hotfix for this. we will see if that make any difference.

Thank you
0
What does it mean to be "Always On"?

Is your cloud always on? With an Always On cloud you won't have to worry about downtime for maintenance or software application code updates, ensuring that your bottom line isn't affected.

 
agurungAuthor Commented:
Hi guys

sorry  I actually forgotten to mention that KB links were sent by "TheCleaner" and thanks "Thecleaner" once again.
0
 
Darwinian999Commented:
If you're getting lousy support from HP, escalate it through their management. A rocket from above to a local office can work wonders. ;)

Hopefully the hotfix referenced by TheCleaner will work though, and you won't need to call HP.
0
 
agurungAuthor Commented:
Hi

We went through KB article http://support.microsoft.com/kb/890018/en-us and downloaded the hotfix only to find out this hotfix was already included in SP1. Situation describe on KB article and our is pretty much similer.

We still have same problem and microsoft advice us to dig around to see if there is any other patch...

Any other idea?

Thank you
0
 
TheCleanerCommented:
Well:

Date         Time   Version       Size     File name
   ----------------------------------------------------
   15-Jan-2005  03:57  5.2.3790.251  144,384  Dmio.sys

Can you confirm that you have ^^^ that file with that size and version number?

Just to be sure.


Also, is that the case that you have an NFS share with quotas enabled or is it just "similar" to that??  That hotfix is pretty specific in what it does.

Is this your posts?  http://groups.google.com/group/microsoft.public.windows.server.general/browse_thread/thread/9e0e0d533a1a387f/ff9f7d374f31f274%23ff9f7d374f31f274

Also, is it possible to upgrade to R2 of storage server?  Do you have Software Assurance?
0
 
agurungAuthor Commented:
Well we found dmio.sys in two places (c:\windows\servicepackfiles\i386 & c:\windows\system32\drivers). we found those files to be version "5.2.3790.1830" size "146KB", Created on "12 july 2005, 1:32:18 AM" and modified on "25 March 2005, 9:00:50AM"

This server is NAS box from HP so it came by default NFS enabled but we haven't done any changes with NFS. One of the reason we haven't disabled NFS as we are on the process of allowing linux(Fedora) clients to access this file server.

Yes  http://groups.google.com/group/microsoft.public.windows.server.general/browse_thread/thread/9e0e0d533a1a387f/ff9f7d374f31f274%23ff9f7d374f31f274
is our posts. we are doing this in several news group to see if any one could help us out.

No, we do not have software assurance and windows2003 storage server came preinstalled with HP and they are very fussy with what we install on this nas box. e.g. we couldn't install SP1 on this box which was downloaded from ms web site instead we had to get the CD from HP to install SP1

Hope this has help to to see where i am coming from.

Thank you
0
 
TheCleanerCommented:
If you are just in the process of allowing NFS, can you disable or uninstall the NFS components for now, and see if it stops hanging?

When you say it "hangs" can you give a full description of what happens?  Is it just really slow getting into Windows and then fine?  Or does it lock up completely on boot and if you reboot all is fine?  Or does it boot fine, get into windows, and then sometime during the day lock up and you have to reboot?  The more details the better.
0
 
agurungAuthor Commented:
We were thinking of disabling the NFS for now, which we will try...

"boot fine, get into windows, and then sometime during the day lock up and you have to reboot?" is what we are experiencing.

We have made this server to reboot every morning as we thought this might help us but it hasn't. During the day users comes to us saying they can not access drives etc & we try login to the server, it just gives us desktop without any icons anything. if we press ctrl + ALT +Del and try rebooting it just stays there. Server can not be pinged we mean no access at all. Only option we have this time is just to power off & power on the server (from switch).

Once we do that it works fine for unknown duration & we will have no idea when server going to hang up again.

Hope this will help you

Thank you for your time.

0
 
agurungAuthor Commented:
Hi

We did disable NFS on our server and still its hanging.. we do not know what to do with this server any more.

Thank you
0
 
Darwinian999Commented:
Call HP and let them sort it out. The server isn't operating as it should, so it they have to fix it under warranty.
0
 
TheCleanerCommented:
I agree with Darwinian....it definitely seems strange.  You might as well try reinstalling everything from scratch too, since they'll probably go that route too, blaming software at this point.
0
 
agurungAuthor Commented:
Now HP has came back and changed RAID Card & Update all the firmware as they wanted to be sure. Problem is our server Hanged yesterday...

We did some performance monitoring of server funny thing is when we looked at Performance from Task Manager, CPU usually sits around 3-5% & memory sits around 900mb mark(out of 4GB)

On the other hand when we checked from performance monitor

"File Data Operations/Sec(System)" sits pretty much 80-100%
"Bytes Total/Sec (Network Interface)" gets up to 100% but comes down after some time.
Pages/Sec (Memory) also gets up to 100% also come down after some time
%Disk Time (Logical) & %Disk Time (Physical) they seems ok as they do UP and down

%Processor Time (Processor) rarely gets more than 25% mark

This is our main file server are these performance are usual?

Now we just have change CPU & Hard disks before we get whole set new.. we have around 250 workstations connecting do you think its because of overloaded server? we have 2 * Xeon Processor, 4GB RAM & GB NIC

Thank you
0
 
Darwinian999Commented:
Most of the counters that you've listed show current values, not percentages, so you're better off looking at the values by changing to "Report view" in performance monitor.

With the specs that you've listed, it's unlikely that your server is overloaded. Even if it was overloaded, it shouldn't hang, the users would just experience slow response from the server.
0
 
agurungAuthor Commented:
Hi All

Now a days it slowly stops responds to the services e.g.not able to connect through Remote DeskTop, not able to open even viewer locally and finally stops everything than all we can do is power off the server manually and power on than everything works as normal.

we can connect to event viewer of server from my workstation and browse it. I checked the services to see if any has stoped but non of them has stoped. server is not just excepting the request.

Thank you again guys stayin with us till now.
0
 
Darwinian999Commented:
Sounds to me like there may be a memory leak. Reboot, then open Task Manager, go to the Performance tab and have a look at the Nonpaged Kernel Memory that's in use. If it creeps up much over the following days, then there's a strong possibility that there's a memory leak in something.

The following URL has lots of info on troubleshooting memoury leaks. http://labmice.techtarget.com/troubleshooting/memoryleaks.htm
0
 
agurungAuthor Commented:
Hi darwinian

Thanks for that Now we are looking on to Symantec Antivirus. We looked at performance monitor and found that "Private Bytes" for "rtvscan" has nearly 3GB ram reserved and never comes down hovers around 2500-3GB ram. We have called symatec suport and still waiting for the call back. see what happens.

Thank you
0
 
agurungAuthor Commented:
Hi Guys

Just wanted to thank you all. We have changed couple more parts in the box and also have upgraded Antivirus. it has been couple of days it hasn't hang so let see how it will go.

Thank you
0

Featured Post

Hire Technology Freelancers with Gigs

Work with freelancers specializing in everything from database administration to programming, who have proven themselves as experts in their field. Hire the best, collaborate easily, pay securely, and get projects done right.

  • 10
  • 5
  • 4
Tackle projects and never again get stuck behind a technical roadblock.
Join Now