SUMMARY: SPARCserver 1000 crashing repeatedly.

From: Steve Kirkpatrick (Steve.Kirkpatrick@artecon.com)
Date: Mon Nov 29 1993 - 19:14:35 CST


First, let me say thanks to all those who replied!

It seems that I have gotten around both of the problems which I sought help
with.

Problem 1:

| I have a SPARCserver 1000 that has been crashing 1-2 times a day for the
| last week. I am hoping it is something simple like a bad SIMM, but I
| can't see evidence of a SIMM location in the following dmesg output:
|
| ------------ Output from dmesg -----------
| BAD TRAP: cpu_id=3 type=9 <Data fault> addr=40 rw=1 rp=e0372b5c
| MMU sfsr=0x126: ft=<Invalid address error> at=<supv data load> level=1
| MMU sfsr=0x126<FAV>
| sched: Data fault
| kernel read fault at addr=0x40, pte=0x1
| MMU sfsr=0x126: ft=<Invalid address error> at=<supv data load> level=1
| MMU sfsr=0x126<FAV>
| put+0x4, pid=0, pc=0xe0020198, sp=0xe0372ba8, psr=0x404004c2, context=0
| g1-g7: 404000e2, f602f000, ffffff00, 0, 0, 1, e0372ec0
| Begin traceback... sp = e0372ba8
- much more deleted...

Looks like the problem was software after all. Since I originally sent my plea,
I upgraded ORACLE7 from 7.0.12 to 7.0.15. Wow! What an improvement. These
particular panics went away, but the system locked up tight a couple of times.
Since there were no errors, I never new what was going on.

A few days after the ORACLE7 upgrade, I upgraded the OS from Solaris 2.2 to 2.3.
Wow2! Also a great improvement (after I applied the 15 or so patches that
is :-). The system has been up since I upgraded to Solaris 2.3 a week ago.

Problem 2:
| The only other repeated error I have seen in the "messages" file is like
| the following:
|
| Nov 8 08:09:04 unix: le2:
| Nov 8 08:09:04 unix: Memory error!
| Nov 8 08:52:36 unix: le2:
| Nov 8 08:52:36 unix: Memory error!

The Solaris 2.3 upgrade seems to have taken care of these errors as well. Nary
a one in the last week.

Bottom line: If you are currently running Solaris 2.2, upgrade to Solaris 2.3
ASAP.

Thanks to the following respondents:
Stephen.G.Scott@att.com
Meno Abels <abels@bach.helios.de>
stern@sunne.East.Sun.COM (Hal Stern - NE Area Systems Engineer)
ddvl@abcomp.be (Dany De Vleeschhauwer)
mh@bacsun.co.at (Martin Hofbauer Bacher Systems EDV GmbH)
Robert Ogren <rmo@teltechlabs.com>

Steve.

---
Steve Kirkpatrick     |  Steve.Kirkpatrick@artecon.com  |  Artecon, Inc. 
MIS Group Leader      |  (619) 431-4478 (Voice)         |  2460 Impala Drive
                      |  (619) 931-5500 (FAX)           |  Carlsbad, CA 92008



This archive was generated by hypermail 2.1.2 : Fri Sep 28 2001 - 23:08:30 CDT