ZFS会一遍又一遍地重新启动重新同步。 也很慢

zpool中进行了几次磁盘replace之后,这是我的一台Solaris 10服务器

pool: volume state: ONLINE status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state. action: Wait for the resilver to complete. scan: resilver in progress since Wed Jul 6 11:43:51 2016 207M scanned out of 36.7T at 1.48M/s, (scan is slow, no estimated time) 207M scanned out of 36.7T at 1.48M/s, 7235h37m to go 13.5M resilvered, 0.00% done config: NAME STATE READ WRITE CKSUM volume ONLINE 0 0 0 raidz2-0 ONLINE 0 0 0 c4t0d0 ONLINE 0 0 1 c4t0d1 ONLINE 0 0 0 c4t0d2 ONLINE 0 0 0 c4t0d3 ONLINE 0 0 0 c4t0d22 ONLINE 0 0 0 c4t0d5 ONLINE 0 0 0 c4t0d6 ONLINE 0 0 0 c4t0d23 ONLINE 0 0 0 (resilvering) c4t0d15 ONLINE 0 0 0 c4t0d9 ONLINE 0 0 0 c4t0d10 ONLINE 0 0 0 raidz2-1 ONLINE 0 0 4 c4t0d11 ONLINE 0 0 0 c4t0d8 ONLINE 0 0 0 (resilvering) c4t0d13 ONLINE 0 0 0 c4t0d14 ONLINE 0 0 0 c4t0d20 ONLINE 0 0 0 c4t0d16 ONLINE 0 0 0 c4t0d4 ONLINE 0 0 0 c4t0d18 ONLINE 0 0 2 c4t0d19 ONLINE 0 0 0 c4t0d17 ONLINE 0 0 0 c4t0d21 ONLINE 0 0 0 errors: No known data errors 

扫描状态每10-15分钟返回到0.00% done ,重新启动重新同步。 这是echo "::zfs_dbgmsg" | mdb -k的输出 echo "::zfs_dbgmsg" | mdb -k

iostat -En在所有磁盘上显示出(并且越来越多)的错误。

zpool iostat -v volume显示正常的重新同步活动(写入新磁盘,从旧磁盘读取)

/var/adm/messages充满了这样的消息:

 Jul 6 12:08:25 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1/sd@0,15 (sd20): Jul 6 12:08:25 raid2 SCSI transport failed: reason 'reset': retrying command Jul 6 12:08:28 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1/sd@0,b (sd8): Jul 6 12:08:28 raid2 Error for Command: read(10) Error Level: Retryable Jul 6 12:08:28 raid2 scsi: [ID 107833 kern.notice] Requested Block: 21523458 Error Block: 21523458 Jul 6 12:08:28 raid2 scsi: [ID 107833 kern.notice] Vendor: transtec Serial Number: 63881076-00 Jul 6 12:08:28 raid2 scsi: [ID 107833 kern.notice] Sense Key: Unit Attention Jul 6 12:08:28 raid2 scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0 Jul 6 12:09:35 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1 (mpt0): Jul 6 12:09:35 raid2 Disconnected command timeout for Target 0 Jul 6 12:09:39 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1/sd@0,15 (sd20): Jul 6 12:09:39 raid2 incomplete read- retrying Jul 6 12:10:46 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1 (mpt0): Jul 6 12:10:46 raid2 Disconnected command timeout for Target 0 Jul 6 12:10:49 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1/sd@0,e (sd11): Jul 6 12:10:49 raid2 incomplete read- retrying Jul 6 12:11:56 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1 (mpt0): Jul 6 12:11:56 raid2 Disconnected command timeout for Target 0 Jul 6 12:13:03 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1/sd@0,1 (sd35): Jul 6 12:13:03 raid2 Error for Command: write Error Level: Retryable Jul 6 12:13:03 raid2 scsi: [ID 107833 kern.notice] Requested Block: 644 Error Block: 644 Jul 6 12:13:03 raid2 scsi: [ID 107833 kern.notice] Vendor: transtec Serial Number: 023CEC5B-00 Jul 6 12:13:03 raid2 scsi: [ID 107833 kern.notice] Sense Key: Unit Attention Jul 6 12:13:03 raid2 scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0 Jul 6 12:13:03 raid2 scsi: [ID 107833 kern.warning] WARNING: /pci@0,0/pci8086,3597@4/pci8086,329@0/pci1000,1060@1/sd@0,5 (sd2): 

有什么我可以做的,使重复结束,使池可以恢复到正常状态?