SUMMARY(3): Sun Fire x2270 AHCI watchdog timeout

From: Bertold Kolics <bertold.kolics_at_unboundid.com>
Date: Wed Nov 24 2010 - 19:18:47 EST
SUMMARY(3):
This is a follow-up to my previous summary.

Oracle has not fixed this issue yet, unfortunately. If you run into this
issue, I encourage you to open a support case. I got reports and confirmed
myself that this is still an issue in the Oracle Solaris 10 09/10 release.

The only other workaround provided by Oracle so far was to disable the CPU
power management features in the BIOS. My test system seems to be working fine
after I disabled the C-State CPU Power Management feature in the BIOS and
disabled the Solaris power daemon (svcadm disable power).

Bertold

SUMMARY(2):
This is a follow-up to my message I sent 12/17/2009. I have been working with
Sun support every since. Sun suggested disabling CPU power management.
Unfortunately, this did not resolve the issue.

The only workaround I have found so far was to downgrade to Solaris 10 5/09. I
have been running several x2270 systems using Solaris 10 5/09 without any
hangs for several weeks.

ORIGINAL ISSUE:
I have Sun Fire x2270 system running Solaris 10 10/09 and using 4 internal
SATA disks. The disks in this system are mirrored 4-way using ZFS. The system
locks up every 2-3 days. When this happens, I can't login from the console (I
can only enter the login name, but I never get to the password prompt).

After power cycling the server,
- fmdump does not show any errors,
- /var/crash/<hostname> is empty,
- ZFS utilities don't indicate any disk errors,
- the service processor's event log has no relevant records,
- and the below lines can be seen in /var/adm/messages (i.e. these are the
last messages in the log before the reboot):

Dec  5 22:55:56 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 2 satapkt 0xfffffe9a07ea2540 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 0 satapkt 0xfffffe9a07ea21c0 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 0 satapkt 0xfffffe9a07ea0380 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 1 satapkt 0xfffffe9a07e71b60 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 1 satapkt 0xffffffffaec87b68 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 1 satapkt 0xfffffe996cdb4e08 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 1 satapkt 0xfffffe996cdbac48 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 3 satapkt 0xfffffe996ec23b60 timed out
Dec  5 22:56:11 x2270-17 ahci: [ID 517647 kern.warning] WARNING: ahci0:
watchdog port 3 satapkt 0xfffffe9a07e56d28 timed out

The system is on the latest firmware/BIOS/service processor release available
from Sun.
---
Bertold Kolics <bertold.kolics@unboundid.com>
Phone: +1.512.600.7706, Fax: +1.512.600.7799

[demime 1.01b removed an attachment of type application/pkcs7-signature which had a name of smime.p7s]
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers
Received on Wed Nov 24 19:20:02 2010

This archive was generated by hypermail 2.1.8 : Thu Mar 03 2016 - 06:44:17 EST