User Tools

Site Tools


zfs:troubleshooting:replace_a_disk

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
zfs:troubleshooting:replace_a_disk [2021/10/13 23:41] – [Replace the Old Device] peterzfs:troubleshooting:replace_a_disk [2021/10/13 23:58] (current) – [Replace the Old Device] peter
Line 74: Line 74:
 </WRAP> </WRAP>
  
-<WRAP info>+<WRAP important>
 ==== Potential Issues ==== ==== Potential Issues ====
  
Line 85: Line 85:
   * This is because the label of the drive that died does not exist in the system any more.   * This is because the label of the drive that died does not exist in the system any more.
   * Therefore the bad device cannot be specified by ID.   * Therefore the bad device cannot be specified by ID.
-  * If this case, try specifying it by device name or by GUID:+  * If this case, try specifying it by device name or by GUID. 
 + 
 +---- 
 + 
 +There are various ways to determine a GUID:
  
 <code bash> <code bash>
Line 94: Line 98:
 </code> </code>
  
-  * If zdb does not output anything, try specifying the device.+---- 
 + 
 +Try to get the GUID using zdb
 + 
 +<code bash> 
 +zdb 
 +testpool: 
 +    version: 28 
 +    name: 'testpool' 
 +    state: 0 
 +    txg: 162804 
 +    pool_guid: 14829240649900366534 
 +    hostname: 'BigMamba' 
 +    vdev_children:
 +    vdev_tree: 
 +        type: 'root' 
 +        id: 0 
 +        guid: 14829240649900366534 
 +        children[0]: 
 +            type: 'raidz' 
 +            id: 0 
 +            guid: 5355850150368902284 
 +            nparity: 1 
 +            metaslab_array: 31 
 +            metaslab_shift: 32 
 +            ashift: 9 
 +            asize: 791588896768 
 +            is_log: 0 
 +            create_txg: 4 
 +            children[0]: 
 +                type: 'disk' 
 +                id: 0 
 +                guid: 11426107064765252810 
 +                path: '/dev/disk/by-id/ata-ST3300620A_5QF0MJFP-part2' 
 +                phys_path: '/dev/gptid/73b31683-537f-11e2-bad7-50465d4eb8b0' 
 +                whole_disk: 1 
 +                create_txg: 4 
 +            children[1]: 
 +                type: 'disk' 
 +                id: 1 
 +                guid: 15935140517898495532 
 +                path: '/dev/disk/by-id/ata-ST3300831A_5NF0552X-part2' 
 +                phys_path: '/dev/gptid/746c949a-537f-11e2-bad7-50465d4eb8b0' 
 +                whole_disk: 1 
 +                create_txg: 4 
 +            children[2]: 
 +                type: 'disk' 
 +                id: 2 
 +                guid: 7183706725091321492 
 +                path: '/dev/disk/by-id/ata-ST3200822A_5LJ1CHMS-part2' 
 +                phys_path: '/dev/gptid/7541115a-537f-11e2-bad7-50465d4eb8b0' 
 +                whole_disk: 1 
 +                create_txg: 4 
 +            children[3]: 
 +                type: 'disk' 
 +                id: 3 
 +                guid: 17196042497722925662 
 +                path: '/dev/disk/by-id/ata-ST3200822A_3LJ0189C-part2' 
 +                phys_path: '/dev/gptid/760a94ee-537f-11e2-bad7-50465d4eb8b0' 
 +                whole_disk: 1 
 +                create_txg: 4 
 +    features_for_read: 
 +</code> 
 + 
 +<WRAP info> 
 +**NOTE:**  The GUID can be ascertained as 15935140517898495532. 
 +</WRAP> 
 + 
 +Use the GUID to offline the old device
 + 
 +<code bash> 
 +zpool offline testpool 15935140517898495532 
 +</code> 
 + 
 +And check this has worked: 
 + 
 +<code bash> 
 +zpool status 
 +  pool: testpool 
 + state: DEGRADED 
 +status: One or more devices has been taken offline by the administrator. 
 +        Sufficient replicas exist for the pool to continue functioning in a 
 +        degraded state. 
 +action: Online the device using 'zpool online' or replace the device with 
 +        'zpool replace'
 +  scan: scrub repaired 0 in 2h4m with 0 errors on Sun Jun  9 00:28:24 2013 
 +config: 
 + 
 +        NAME                         STATE     READ WRITE CKSUM 
 +        testpool                     DEGRADED             0 
 +          raidz1-0                   DEGRADED             0 
 +            ata-ST3300620A_5QF0MJFP  ONLINE               0 
 +            ata-ST3300831A_5NF0552X  OFFLINE      0         0 
 +            ata-ST3200822A_5LJ1CHMS  ONLINE               0 
 +            ata-ST3200822A_3LJ0189C  ONLINE               0 
 + 
 +errors: No known data errors 
 +</code> 
 + 
 +and then replace the pool: 
 + 
 +<code bash> 
 +zpool replace testpool 15935140517898495532 /dev/disk/by-id/ata-ST3500320AS_9QM03ATQ 
 +</code> 
 + 
 +And check again this has worked: 
 + 
 +<code bash> 
 +zpool status 
 +  pool: testpool 
 + state: DEGRADED 
 +status: One or more devices is currently being resilvered.  The pool will 
 +        continue to function, possibly in a degraded state. 
 +action: Wait for the resilver to complete. 
 +  scan: resilver in progress since Sun Jun  9 01:44:36 2013 
 +    408M scanned out of 419G at 20,4M/s, 5h50m to go 
 +    101M resilvered, 0,10% done 
 +config: 
 + 
 +        NAME                            STATE     READ WRITE CKSUM 
 +        testpool                        DEGRADED             0 
 +          raidz1-0                      DEGRADED             0 
 +            ata-ST3300620A_5QF0MJFP     ONLINE               0 
 +            replacing-1                 OFFLINE      0         0 
 +              ata-ST3300831A_5NF0552X   OFFLINE      0         0 
 +              ata-ST3500320AS_9QM03ATQ  ONLINE                (resilvering) 
 +            ata-ST3200822A_5LJ1CHMS     ONLINE               0 
 +            ata-ST3200822A_3LJ0189C     ONLINE               0 
 + 
 +errors: No known data errors 
 +</code> 
 </WRAP> </WRAP>
  
zfs/troubleshooting/replace_a_disk.1634168490.txt.gz · Last modified: 2021/10/13 23:41 by peter

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki