Topics:
DVR
nvrec
Mplayer
Links
Misc
Commands
Humor
New user
uploaded files
|
(linux_command_line)-> White box crash after long uptime |
submited by Russell Wed 09 Nov 05 |
I have white box linux installed on a computer in my office that we use as a file server. It was online w/o issues for 71 days. well w/o issues for about 69 days. the past few days it seem sluggish, then this afternoon, it crashed with a kernal error ( sync failed). I happend to have top running on a remote machine. here is what it looked like just before the crash:
top - 15:22:23 up 71 days, 6:47, 2 users, load average: 5.97, 5.75, 6.39
Tasks: 99 total, 1 running, 98 sleeping, 0 stopped, 0 zombie
Cpu(s): 1.6% us, 1.6% sy, 0.0% ni, 0.0% id, 96.4% wa, 0.3% hi, 0.0% si
Mem: 157704k total, 156756k used, 948k free, 356k buffers
Swap: 327672k total, 205196k used, 122476k free, 6512k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
25594 russell 16 0 8752 296 188 S 1.0 0.2 0:41.71 sshd
26449 root 15 0 93136 4372 448 D 1.0 2.8 1:14.45 perl
25622 russell 17 0 3740 596 416 R 0.7 0.4 2:10.42 top
30 root 15 0 0 0 0 S 0.3 0.0 70:18.49 kswapd0
1 root 16 0 3436 68 48 S 0.0 0.0 0:31.82 init
2 root 34 19 0 0 0 S 0.0 0.0 0:21.98 ksoftirqd/0
3 root 5 -10 0 0 0 S 0.0 0.0 0:01.54 events/0
4 root 5 -10 0 0 0 S 0.0 0.0 0:00.04 khelper
5 root 5 -10 0 0 0 S 0.0 0.0 0:00.90 kblockd/0
6 root 15 0 0 0 0 S 0.0 0.0 0:00.00 khubd
31 root 10 -10 0 0 0 S 0.0 0.0 0:00.00 aio/0
26 root 15 0 0 0 0 S 0.0 0.0 0:00.84 kapmd
105 root 25 0 0 0 0 S 0.0 0.0 0:00.00 kseriod
192 root 6 -10 0 0 0 S 0.0 0.0 0:00.00 kmirrord/0
205 root 15 0 0 0 0 S 0.0 0.0 2:10.39 kjournald
1056 root 6 -10 1824 0 0 S 0.0 0.0 0:00.03 udevd
1228 root 15 0 0 0 0 S 0.0 0.0 161:13.20 md2_raid5
What is odd to me about this is that no process is doing much of anything , but the sysload is almost 6.
A normal top session, after the reboot looks like this:
top - 17:35:10 up 23 min, 1 user, load average: 0.00, 0.00, 0.05
Tasks: 61 total, 1 running, 60 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.7% us, 0.7% sy, 0.0% ni, 98.4% id, 0.0% wa, 0.0% hi, 0.3% si
Mem: 157704k total, 72088k used, 85616k free, 9712k buffers
Swap: 327672k total, 0k used, 327672k free, 31456k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
2603 russell 16 0 7620 2224 1816 S 0.7 1.4 0:07.21 sshd
2631 russell 17 0 3716 932 756 R 0.3 0.6 0:06.38 top
.....
Notice that more than half of the physical memory is free after a reboot.
This gets me thinking that I may have a memory leak. but not just any memory leak, but a leak in kernel, unswappable memory. which would suffocate the machine as it tried to live with less and less avaible real memory. I don't have any hard evidence of this, but I want to keep an eye on this.
|
Add comment or question...:
|