Replies: 1 comment 1 reply
-
Assuming you'd be open to upgrading to a newer version of ZFS:
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I’m trying to better understand how ZFS handles disk errors and how to mitigate cases where bad disks are causing performance issues.
Dealing with servers in production environments where availability is top priority. Running older OpenZFS v0.8.6 on AlmaLinux 8.x. Couple of zpools present with many drives, with vdevs cfg’d for RAIDZ3. Disks cannot be replaced in production.
One of the vdevs has only 6 disks and can suffer varying levels of performance degradation when disks have errors and ZFS is performing self healing operations.
It seems like the disks have to be in a pretty bad state before ZFS will give up on them. Looking for some input on strategies for reducing the negative impacts of bad disks by either adjusting ZFS knobs and/or adding mechanism to monitor disk health and offline bad disks either via kernel or ZFS.
Any recommendations for monitoring and mitigating performance impact of failing disks?
Beta Was this translation helpful? Give feedback.
All reactions