我遇到了断电/电涌事件,导致我的 Synology 磁盘站上的 RAID10 处于不可用状态。阵列是 RAID10 中的 8x 6TB 磁盘。我可以通过 SSH 访问 NAS 并进行了一些基本分析。

磁盘 1-3 正常。磁盘 5-8 显示一个或两个 LBA 的不可纠正读取错误(全部位于同一位置)。磁盘 5-8 也不再有 RAID 超级块。Fdisk 可以看到所有磁盘上的所有分区。

我已经能够确定不再具有超级块的成员的 UUID。我的直觉是,我可以恢复大部分数据,但会丢失一些文件或目录中的一些小数据,但我需要重新创建阵列。由于这个阵列太大,我没有足够的空间进行完整克隆。

也请不要再提醒我备份了。我备份了部分数据。大部分数据没有备份。有些我真正想找回的数据没有备份。我完全意识到了自己的错误。我不想被教导是否应该备份以及如何从问题和数据恢复步骤中转移注意力。谢谢。

请参阅以下有用的输出(mdadm、smartctl 等):

smartctl 自检日志(坏磁盘的示例,由于帖子限制而被截断):

编辑:在我的台式电脑上检查 Synology smartctl 测试中报告读取错误的磁盘之一后,测试通过且没有读取错误。然后我将其放回 nas 中并再次检查,nas 再次看到读取错误。我怀疑 SATA 背板卡在端口 5 到 8 上坏了。该设备保修期仅剩 2 个月,但 Synology 将查看是否有库存的替换卡可供购买。

编辑 2:有人知道我是否可以取出 RAID10 磁盘的前半部分(或后半部分)(它们是按数字顺序添加的)并以任何方式尝试基于 RAID1 进行恢复吗?这将减少我的尝试范围,减少存储 ddrescue 转储所需的存储量,并且基本上给我两次恢复的机会。我认为组成一个镜像的 4 个磁盘中有 3 个标记为良好且完全可读。在第 4 个驱动器上丢失一个扇区肯定不是游戏结束的情况?


=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%     27829         9437200
# 2  Short offline       Completed: read failure       90%     27826         9437192
# 3  Short offline       Completed without error       00%     17628         -
# 4  Short offline       Completed without error       00%     16884         -
# 5  Short offline       Completed without error       00%     16164         -
# 6  Short offline       Completed without error       00%     15420         -
# 7  Short offline       Completed without error       00%     14700         -
# 8  Short offline       Completed without error       00%     13908         -
# 9  Short offline       Completed without error       00%     13164         -
#10  Short offline       Completed without error       00%     12420         -
#11  Short offline       Completed without error       00%     11700         -
#12  Short offline       Completed without error       00%     10955         -
#13  Short offline       Completed without error       00%     10235         -
#14  Short offline       Completed without error       00%      9491         -
#15  Short offline       Completed without error       00%      8748         -
#16  Short offline       Completed without error       00%      8027         -
#17  Short offline       Completed without error       00%      7283         -
#18  Short offline       Completed without error       00%      6563         -
#19  Short offline       Completed without error       00%      5820         -
#20  Short offline       Completed without error       00%      5148         -
#21  Short offline       Completed without error       00%      4404         -

smartctl 6.5 (build date Sep 26 2022) [x86_64-linux-4.4.302+] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%     27828         9437192
# 2  Short offline       Completed: read failure       90%     27826         9437192
# 3  Extended offline    Completed: read failure       90%     27825         9437200
# 4  Short offline       Completed without error       00%     17628         -
# 5  Short offline       Completed without error       00%     16884         -
# 6  Short offline       Completed without error       00%     16164         -
# 7  Short offline       Completed without error       00%     15420         -
# 8  Short offline       Completed without error       00%     14700         -
# 9  Short offline       Completed without error       00%     13908         -
#10  Short offline       Completed without error       00%     13164         -
#11  Short offline       Completed without error       00%     12420         -
#12  Short offline       Completed without error       00%     11700         -
#13  Short offline       Completed without error       00%     10955         -
#14  Short offline       Completed without error       00%     10235         -
#15  Short offline       Completed without error       00%      9491         -
#16  Short offline       Completed without error       00%      8748         -
#17  Short offline       Completed without error       00%      8027         -
#18  Short offline       Completed without error       00%      7283         -
#19  Short offline       Completed without error       00%      6563         -
#20  Short offline       Completed without error       00%      5820         -
#21  Short offline       Completed without error       00%      5148         -

smartctl 6.5 (build date Sep 26 2022) [x86_64-linux-4.4.302+] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%     27828         9437192
# 2  Short offline       Completed: read failure       90%     27826         9437192
# 3  Short offline       Completed without error       00%     17628         -
# 4  Short offline       Completed without error       00%     16884         -
# 5  Short offline       Completed without error       00%     16164         -
# 6  Short offline       Completed without error       00%     15420         -
# 7  Short offline       Completed without error       00%     14700         -
# 8  Short offline       Completed without error       00%     13908         -
# 9  Short offline       Completed without error       00%     13164         -
#10  Short offline       Completed without error       00%     12420         -
#11  Short offline       Completed without error       00%     11700         -
#12  Short offline       Completed without error       00%     10955         -
#13  Short offline       Completed without error       00%     10235         -
#14  Short offline       Completed without error       00%      9491         -
#15  Short offline       Completed without error       00%      8747         -
#16  Short offline       Completed without error       00%      8027         -
#17  Short offline       Completed without error       00%      7283         -
#18  Short offline       Completed without error       00%      6563         -
#19  Short offline       Completed without error       00%      5820         -
#20  Short offline       Completed without error       00%      5148         -
#21  Short offline       Completed without error       00%      4404         -

smartctl 6.5 (build date Sep 26 2022) [x86_64-linux-4.4.302+] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%     27828         9437192
# 2  Short offline       Completed: read failure       90%     27826         9437192
# 3  Short offline       Completed without error       00%     17628         -
# 4  Short offline       Completed without error       00%     16884         -
# 5  Short offline       Completed without error       00%     16164         -
# 6  Short offline       Completed without error       00%     15420         -
# 7  Short offline       Completed without error       00%     14700         -
# 8  Short offline       Completed without error       00%     13908         -
# 9  Short offline       Completed without error       00%     13164         -
#10  Short offline       Completed without error       00%     12420         -
#11  Short offline       Completed without error       00%     11700         -
#12  Short offline       Completed without error       00%     10955         -
#13  Short offline       Completed without error       00%     10235         -
#14  Short offline       Completed without error       00%      9491         -
#15  Short offline       Completed without error       00%      8747         -
#16  Short offline       Completed without error       00%      8027         -
#17  Short offline       Completed without error       00%      7283         -
#18  Short offline       Completed without error       00%      6563         -
#19  Short offline       Completed without error       00%      5820         -
#20  Short offline       Completed without error       00%      5148         -
#21  Short offline       Completed without error       00%      4404         -

smartctl 6.5 (build date Sep 26 2022) [x86_64-linux-4.4.302+] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed: read failure       90%     27826         9437192
# 2  Short offline       Completed without error       00%     17628         -
# 3  Short offline       Completed without error       00%     16884         -
# 4  Short offline       Completed without error       00%     16164         -
# 5  Short offline       Completed without error       00%     15420         -
# 6  Short offline       Completed without error       00%     14700         -
# 7  Short offline       Completed without error       00%     13908         -
# 8  Short offline       Completed without error       00%     13164         -
# 9  Short offline       Completed without error       00%     12420         -
#10  Short offline       Completed without error       00%     11700         -
#11  Short offline       Completed without error       00%     10955         -
#12  Short offline       Completed without error       00%     10235         -
#13  Short offline       Completed without error       00%      9491         -
#14  Short offline       Completed without error       00%      8747         -
#15  Short offline       Completed without error       00%      8027         -
#16  Short offline       Completed without error       00%      7283         -
#17  Short offline       Completed without error       00%      6563         -
#18  Short offline       Completed without error       00%      5820         -
#19  Short offline       Completed without error       00%      5148         -
#20  Short offline       Completed without error       00%      4404         -
#21  Short offline       Completed without error       00%      3660         -

mdadm 检查

root@nas:~# for disk in {1..8}; do mdadm --examine /dev/sata${disk}p3; done
/dev/sata1p3:
          Magic : a92b4efc
        Version : 1.2
    Feature Map : 0x0
     Array UUID : e0dbf189:7ea6486d:781cfdbe:7460653e
           Name : syn_nas:2
  Creation Time : Mon Jul  5 12:42:12 2021
     Raid Level : raid10
   Raid Devices : 8

 Avail Dev Size : 11711401120 (5584.43 GiB 5996.24 GB)
     Array Size : 23422802176 (22337.72 GiB 23984.95 GB)
  Used Dev Size : 11711401088 (5584.43 GiB 5996.24 GB)
    Data Offset : 2048 sectors
   Super Offset : 8 sectors
   Unused Space : before=1968 sectors, after=32 sectors
          State : active
    Device UUID : 5b7d825b:a8f72b91:80b9f543:f61f4901

    Update Time : Wed Oct  2 18:52:24 2024
       Checksum : a567977c - correct
         Events : 1064

         Layout : near=2
     Chunk Size : 64K

   Device Role : Active device 0
   Array State : AAAAAAAA ('A' == active, '.' == missing, 'R' == replacing)
/dev/sata2p3:
          Magic : a92b4efc
        Version : 1.2
    Feature Map : 0x0
     Array UUID : e0dbf189:7ea6486d:781cfdbe:7460653e
           Name : syn_nas:2
  Creation Time : Mon Jul  5 12:42:12 2021
     Raid Level : raid10
   Raid Devices : 8

 Avail Dev Size : 11711401120 (5584.43 GiB 5996.24 GB)
     Array Size : 23422802176 (22337.72 GiB 23984.95 GB)
  Used Dev Size : 11711401088 (5584.43 GiB 5996.24 GB)
    Data Offset : 2048 sectors
   Super Offset : 8 sectors
   Unused Space : before=1968 sectors, after=32 sectors
          State : active
    Device UUID : 6598be06:4f6cdcb0:4c2ede80:b8634a8e

    Update Time : Wed Oct  2 18:52:24 2024
       Checksum : 3a3ddfbd - correct
         Events : 1064

         Layout : near=2
     Chunk Size : 64K

   Device Role : Active device 1
   Array State : AAAAAAAA ('A' == active, '.' == missing, 'R' == replacing)
/dev/sata3p3:
          Magic : a92b4efc
        Version : 1.2
    Feature Map : 0x0
     Array UUID : e0dbf189:7ea6486d:781cfdbe:7460653e
           Name : syn_nas:2
  Creation Time : Mon Jul  5 12:42:12 2021
     Raid Level : raid10
   Raid Devices : 8

 Avail Dev Size : 11711401120 (5584.43 GiB 5996.24 GB)
     Array Size : 23422802176 (22337.72 GiB 23984.95 GB)
  Used Dev Size : 11711401088 (5584.43 GiB 5996.24 GB)
    Data Offset : 2048 sectors
   Super Offset : 8 sectors
   Unused Space : before=1968 sectors, after=32 sectors
          State : active
    Device UUID : b249bbb5:5cd44246:86bc5f22:164682a4

    Update Time : Wed Oct  2 18:52:24 2024
       Checksum : 365a69b0 - correct
         Events : 1064

         Layout : near=2
     Chunk Size : 64K

   Device Role : Active device 2
   Array State : AAAAAAAA ('A' == active, '.' == missing, 'R' == replacing)
mdadm: No md superblock detected on /dev/sata4p3.
mdadm: No md superblock detected on /dev/sata5p3.
mdadm: No md superblock detected on /dev/sata6p3.
mdadm: No md superblock detected on /dev/sata7p3.
mdadm: No md superblock detected on /dev/sata8p3.

spacetool_synoblock_enum (synology 工具)

root@nas:~# spacetool --synoblock-enum
****** Syno-Block of /dev/sata1 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]

****** Syno-Block of /dev/sata2 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]

****** Syno-Block of /dev/sata3 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]

****** Syno-Block of /dev/sata4 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]

****** Syno-Block of /dev/sata5 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]

****** Syno-Block of /dev/sata6 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]
****** Syno-Block of /dev/nvme0n1 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: SSD Cache
Space Path: 83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT
Reuse ID: []
Descriptions: []
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_12
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
RAID UUID: [470b94ec:85ecef3e:3244234e:bf658f97]

****** Syno-Block of /dev/nvme1n1 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: SSD Cache
Space Path: 83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT
Reuse ID: []
Descriptions: []
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_12
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
RAID UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
****** Syno-Block of /dev/sata7 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]

****** Syno-Block of /dev/sata8 ******
Version: 7
Capability: 0x2
Features: 1
Space Type: Storage Pool
Space Path: @storage_pool
Reuse ID: [reuse_1]
Descriptions: [raid10]
Volume Description: []
Cross EBox: FALSE
Limited RAID Group: SYNO_24
Cache Info:
        Cache Mode: Read
        Cache Type: SSD Cache
        SSD UUID: [470b94ec:85ecef3e:3244234e:bf658f97]
        Cache size: 429496729600
        Shared Cache ID: 0
        Allocated Cache Sub ID: 0
        SSD ID: 1
        Target Space UUID: [83OFdO-o3zP-JVXO-N841-vfRv-dM6x-AdacPT]
        Skip sequential IO: 1
        Max Degrade Flush: 0
        APM Flush Done: 0
        Allowed Drop Cache: 0
PV count: 1
VG path: /dev/vg1
VG UUID: [rJ0X3K-x3cp-Zw8G-0f2U-8BU2-D3VN-y1vlMk]
RAID UUID: [e0dbf189:7ea6486d:781cfdbe:7460653e]

mdadm 扫描

root@nas:~# mdadm --assemble --scan
mdadm: /dev/md/syn_nas:2 assembled from 3 drives - not enough to start the array.

7

  • 1
    如果可以,第一步是制作故障磁盘的磁盘映像。使用ddrescue或类似的东西。


    – 

  • 我很想这么做,但我没有足够的存储空间(24TB)。我曾考虑过将映像复制到 backblaze 或类似产品,但我不知道可以使用哪个命令行客户端。


    – 

  • 然后找到一个具有必要存储资源的付费服务来执行此操作。是的,您需要将此转储作为第一阶段。


    – 

  • 好的,太好了,谢谢。那么接下来的步骤是什么呢?


    – 


  • 可以使用与首次创建超级块时完全相同的参数来重建 RAID 超级块。


    – 

0