-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Receive of unencrypted dataset crashes system 0.7.11 #8030
Comments
Are you saying that one side is 0.7.11 and the other is 0.7.10, or that you've tested it on both of those versions? Are both the sender and receiver running the same kernel/OS/etc? What's in /proc/spl/kstat/zfs/dbgmsg? (Please just pastebin the whole thing, rather than inlining a few lines.) Are any of the kernel threads pegging the CPU? Are you sure there's not another message like VERIFY3(...) or PANIC[...] above those two? Otherwise those two messages don't necessarily mean anything harmful, just slow. |
The recv side has been up a while, and everything is on zfs root, so its 0.7.10. This has been happening in one way or another for ages though, the cv_wait_common from zfs_ioc_recv calls keep coming up in stack traces. |
I'd suggest not running one of the two blacklisted versions of the 0.7.X series on the receiver, but other than that, the information I requested from you is still needed to do anything useful. |
Blacklisted? |
Both 0.7.7 and 0.7.10 have rather critical flaws, are struck out on the homepage, and have big "don't use this" notices at the top of their respective releases. I doubt, in this case, that the problem you're having is related to 0.7.10's nasty bugs, but it's something I'd eliminate before trying to look further. |
Thanks for the heads up. Agree this is not likely to be it though, pretty sure it's much older than both. Updating it to the kernel with .11 built in. |
This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions. |
System information
Distribution Name | Arch
Distribution Version | Current
Linux Kernel | 4.14
Architecture | x64
ZFS Version | 0.7.11 & 0.7.10
SPL Version | 0.7.11 & 0.7.10
Describe the problem you're observing
Sending incremental or full streams causes crashes on receiving system and prevent send completion.
Describe how to reproduce the problem
Znapzend seems to do it pretty consistently via znapzend --runonce--autoCreation --nodestroy, any send/recv for the rpool does this. Scrub of rpool on sending machine reveals no damage, but this wouldn't be the first time we found that scrub doesn't actually understand some classes of on-disk error.
Include any warning/errors/backtraces from the system logs
On the recv size we see this:
On the sending side, it still thinks something is receiving on the other side, but data transfer rate is zero.
The text was updated successfully, but these errors were encountered: