ASKER
ASKER
Linux is a UNIX-like open source operating system with hundreds of distinct distributions, including: Fedora, openSUSE, Ubuntu, Debian, Slackware, Gentoo, CentOS, and Arch Linux. Linux is generally associated with web and database servers, but has become popular in many niche industries and applications.
TRUSTED BY
1) In one window run...
Open in new window
2) Now in another window run this command.
Open in new window
3) Then wait...
At some point the machine will die.
Hopefully the while loop will show the exact time of machine death... then...
The ~/fs.log file will show last few log files touched, which will help narrow down where to look for the problem.
4) Another likely command to run will be this...
Open in new window
This will correlate process table entries with machine death time.
For example, if someone has made the mistake of calling out to some 3rd party service (like to a CRM to record an opt-in), if the 3rd party API is slower than your machine, or 3rd party API just goes down periodically, then your machine will start piling up processes... likely PHP processes... which never end...
Once you have 100s-1000s+ of processes that never end, machine death will occur.
5) More data to capture is swap data, which is simple to capture...
Open in new window
If you see swap space usage trend up + max out your space space, this means your OOM Killer (Out Of Memory Killer) triggers, which randomly kills processes, trying to keep the machine alive.
This always fails, as some important process always seems to get killed off.