Summary: Mirror down in SunFire 240R

From: Andreas Höschler <ahoesch_at_smartsoft.de>
Date: Fri Apr 08 2005 - 07:35:10 EDT
Dear managers,

thanks  a lot for your very helpful responses:

Daniel Nuno <daniel.nuno@gmail.com>
Adam Tomkinson <adam.tomkinson@britannia.co.uk>
"Thomas M. Payerle" <payerle@physics.umd.edu>
"Dell, Mary" <mdell@mesirowfinancial.com>
"Tom Grassia" <tgrassia@sfnewmexican.com>
"Michael Schneider/calispera.com" <michael.schneider@calispera.com>
Steve Sandau <ssandau@gwi.net>
"joe_fletcher" <joe_fletcher@btconnect.com>
Sam Nelson <sam@unix.ms>
"Harrington, David B (Contractor) (J6R)" <David.Harrington.ctr@dla.mil>
Gary Chambers <gwc@ll.mit.edu>
"NO UCE" <nouce@mighty.co.za>

I used format to verify that we have a bad disk

AVAILABLE DISK SELECTIONS:
        0. c1t0d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
           /pci@1c,600000/scsi@2/sd@0,0
        1. c1t1d0 <drive not available>
           /pci@1c,600000/scsi@2/sd@1,0

I broke the mirror

> metadetach -f d0 d20
> metadetach -f d1 d21
> metadetach -f d3 d23
>
> metaclear -f d23 1 1 c1t1d0s3
> metaclear -f d21 1 1 c1t1d0s1
> metaclear -f d20 1 1 c1t1d0s0

Since the machine was only half a year old this problem was resolved by
Sun. They sent a technician that replaced the disk and most likely (I
could not be present) did the following to rebuild the mirror.

	dd if=/dev/rdsk/c1t0d0s2 of=/dev/rdsk/c1t1d0s2 count=16

	metadb -a -c 2 c1t1d0s7

	metainit d20 1 1 c1t1d0s0
	metainit d21 1 1 c1t1d0s1
	metainit d23 1 1 c1t1d0s3

	metattach d0 d20
	metattach d1 d21
	metattach d3 d23

What is the difference between

	dd if=/dev/rdsk/c1t0d0s2 of=/dev/rdsk/c1t1d0s2 count=16

and

	prtvtoc /dev/rdsk/c1t0d0s2 | fmthard -s - /dev/rdsk/c1t1d0s2

The second was recommended since dd was said to copy the whole disk.
But as I understand it the above dd command only copies the partition
table (partition s2). Is this correct or wrong?

Thanks a lot!

Regards,

   Andreas


Original question:
====================================================================
From: Andreas Hvschler <ahoesch@smartsoft.de>
Date: Tue Mar 29, 2005  10:19:45 PM Europe/Berlin
To: sunmanagers@sunmanagers.org
Subject: Mirror down in SunFire 240R

Dear managers,

we have a production machine Sun Fire 240 with two mirrored 73 GByte
SCSI-disks. The mirror was fine for a year. Now I get the following:

bash-2.05# metastat
d3: Mirror
     Submirror 0: d13
       State: Okay
     Submirror 1: d23
       State: Needs maintenance
     Pass: 1
     Read option: roundrobin (default)
     Write option: parallel (default)
     Size: 10247232 blocks (4.9 GB)

d13: Submirror of d3
     State: Okay
     Size: 10247232 blocks (4.9 GB)
     Stripe 0:
         Device     Start Block  Dbase        State Reloc Hot Spare
         c1t0d0s3          0     No            Okay   Yes


d23: Submirror of d3
     State: Unavailable
     Size: 10247232 blocks (4.9 GB)
     Stripe 0:
         Device     Start Block  Dbase        State Reloc Hot Spare
         c1t1d0s3          0     No               -   Yes


d1: Mirror
     Submirror 0: d11
       State: Okay
     Submirror 1: d21
       State: Needs maintenance
     Pass: 1
     Read option: roundrobin (default)
     Write option: parallel (default)
     Size: 14337984 blocks (6.8 GB)

d11: Submirror of d1
     State: Okay
     Size: 14337984 blocks (6.8 GB)
     Stripe 0:
         Device     Start Block  Dbase        State Reloc Hot Spare
         c1t0d0s1          0     No            Okay   Yes


d21: Submirror of d1
     State: Unavailable
     Size: 14337984 blocks (6.8 GB)
     Stripe 0:
         Device     Start Block  Dbase        State Reloc Hot Spare
         c1t1d0s1          0     No               -   Yes


d0: Mirror
     Submirror 0: d10
       State: Okay
     Submirror 1: d20
       State: Needs maintenance
     Pass: 1
     Read option: roundrobin (default)
     Write option: parallel (default)
     Size: 114591936 blocks (54 GB)

d10: Submirror of d0
     State: Okay
     Size: 114591936 blocks (54 GB)
     Stripe 0:
         Device     Start Block  Dbase        State Reloc Hot Spare
         c1t0d0s0          0     No            Okay   Yes


d20: Submirror of d0
     State: Unavailable
     Size: 114591936 blocks (54 GB)
     Stripe 0:
         Device     Start Block  Dbase        State Reloc Hot Spare
         c1t1d0s0          0     No               -   Yes


Device Relocation Information:
Device   Reloc  Device ID
c1t1d0   Yes    id1,sd@SSEAGATE_ST373307LSUN72G_3HZ6Y3DJ00007431G1EF
c1t0d0   Yes    id1,sd@SSEAGATE_ST373307LSUN72G_3HZ6Y1YR00007431G08H

The system is up and running but very obviously there is something
wrong with the second disk. I have never encountered such a situation
and am not sure what to do now. I would really appreciate your hints
before I do something wrong with a production machine.

Thanks a lot in advance!

Regards,

   Andreas
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers
Received on Fri Apr 8 07:35:44 2005

This archive was generated by hypermail 2.1.8 : Thu Mar 03 2016 - 06:43:45 EST