-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ZFS Stability Issue...and then long unacceptable recovery time #4199
Comments
@rsvancara I'd strongly encourage you up upgrade your system to the 0.6.5.4 release which was made available just a few days ago. The CentOS repositories have already been updated so you just need to update the system. It contains fixes for the majority and most common deadlocks and stability issues which have been reported. We're working on the remaining issues but since those patches are still under review and testing we didn't want them to hold up this release. https://github.com/zfsonlinux/zfs/releases/tag/zfs-0.6.5.4 As for the worst case recovery (more like cleanup) time this is definitely something we want to tackle but haven't had a chance to yet. In principle there's no reason that recovery can't safely happen in the background after the mount completes. It's just going to take a little care to get it right and tested. |
Thanks, I will give 0.6.5.4 a try. Anything is better than what we have On Sun, Jan 10, 2016 at 11:07 AM, Brian Behlendorf <notifications@github.com
Randall Svancara |
@rsvancara it would be very helpful if you could let us know what issues (if any) you're still seeing after the update. That would help us focus our efforts on the most critical remaining issues. |
Oh you dont have to worry about that. I have this volume that is 10TB and On Sun, Jan 10, 2016 at 11:44 AM, Brian Behlendorf <notifications@github.com
Randall Svancara |
I have installed 0.6.5.1. Going to test it now.... On Sun, Jan 10, 2016 at 12:01 PM, Randall Svancara rsvancara@gmail.com
Randall Svancara |
@rsvancara silence is good I trust? |
I am having good luck so far. I have ran some tests using multiple On Tue, Jan 12, 2016 at 3:20 PM, Brian Behlendorf notifications@github.com
Randall Svancara |
Looks like the issue is closed, feel free to reopen it if it's not. |
Extra Information:
We are experiencing serious stability problems with ZFS on Linux. We are using server class hardware with ECC memory. What I am seeing is that the filesystem is used heavily and it crashes.
When the system does crash, it takes almost 8 hours to recover. Obviously this is a production system and waiting this amount of time is completely unacceptable.
zpool status:
pool: data
state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
the pool may no longer be accessible by software that does not support
the features. See zpool-features(5) for details.
scan: scrub repaired 0 in 0h17m with 0 errors on Tue Jan 5 11:31:46 2016
config:
The Kernel Logs are here:
https://gist.github.com/rsvancara/a3bef531f25b197deab7
zfs get all
NAME PROPERTY VALUE SOURCE
data/fastscratch type filesystem -
data/fastscratch creation Mon Jun 29 19:57 2015 -
data/fastscratch used 691G -
data/fastscratch available 10.9T -
data/fastscratch referenced 691G -
data/fastscratch compressratio 2.53x -
data/fastscratch mounted no -
data/fastscratch quota none default
data/fastscratch reservation none default
data/fastscratch recordsize 64K local
data/fastscratch mountpoint /data/fastscratch default
data/fastscratch sharenfs on local
data/fastscratch checksum on default
data/fastscratch compression on local
data/fastscratch atime off local
data/fastscratch devices on default
data/fastscratch exec on default
data/fastscratch setuid on default
data/fastscratch readonly off default
data/fastscratch zoned off default
data/fastscratch snapdir hidden default
data/fastscratch aclinherit restricted default
data/fastscratch canmount on default
data/fastscratch xattr on default
data/fastscratch copies 1 default
data/fastscratch version 5 -
data/fastscratch utf8only off -
data/fastscratch normalization none -
data/fastscratch casesensitivity sensitive -
data/fastscratch vscan off default
data/fastscratch nbmand off default
data/fastscratch sharesmb off default
data/fastscratch refquota none default
data/fastscratch refreservation none default
data/fastscratch primarycache all local
data/fastscratch secondarycache all default
data/fastscratch usedbysnapshots 0 -
data/fastscratch usedbydataset 691G -
data/fastscratch usedbychildren 0 -
data/fastscratch usedbyrefreservation 0 -
data/fastscratch logbias throughput local
data/fastscratch dedup off default
data/fastscratch mlslabel none default
data/fastscratch sync standard default
data/fastscratch refcompressratio 2.53x -
data/fastscratch written 691G -
data/fastscratch logicalused 1.70T -
data/fastscratch logicalreferenced 1.70T -
data/fastscratch filesystem_limit none default
data/fastscratch snapshot_limit none default
data/fastscratch filesystem_count none default
data/fastscratch snapshot_count none default
data/fastscratch snapdev hidden default
data/fastscratch acltype off default
data/fastscratch context none default
data/fastscratch fscontext none default
data/fastscratch defcontext none default
data/fastscratch rootcontext none default
data/fastscratch relatime off default
data/fastscratch redundant_metadata all default
data/fastscratch overlay off default
The text was updated successfully, but these errors were encountered: