Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hang under memory pressure #2308

Closed
andreevmipt opened this issue May 7, 2014 · 16 comments
Closed

Hang under memory pressure #2308

andreevmipt opened this issue May 7, 2014 · 16 comments

Comments

@andreevmipt
Copy link

Hang (sometimes complete, sometimes recoverable with SysRq + E) on a computer with 4G phys memory under heavy memory pressure.
3.14.2-aufs #4 SMP PREEMPT
spl: 0.6.2-36_g703371d
zfs: 0.6.2-274_g2c33b91

Following kernel params set:
Hang detection timeout reduced to 20 sec (for debug)
CONFIG_TRANSPARENT_HUGEPAGE is not set
vm.dirty_ratio = 10

What else can be done to reduce chance to get this condition?

[ 2341.763203] INFO: task systemd:1 blocked for more than 20 seconds.
[ 2341.763207] Tainted: P W O 3.14.2-aufs #4
[ 2341.763208] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 2341.763209] systemd D ffff88012fb93280 4072 1 0 0x00000000
[ 2341.763213] ffff880129ac3628 0000000000000002 ffff880129ae0000 ffff880129ac3fd8
[ 2341.763215] 0000000000013280 0000000000013280 ffff880129ae0000 ffff8800c6d1b890
[ 2341.763217] ffff8800c6d1b890 ffff880129ae0000 ffff8800c6d1b898 0000000000000246
[ 2341.763220] Call Trace:
[ 2341.763226] [] schedule_preempt_disabled+0x2e/0x80
[ 2341.763229] [] __mutex_lock_slowpath+0x12f/0x3a0
[ 2341.763231] [] mutex_lock+0x23/0x33
[ 2341.763251] [] vdev_queue_io+0x85/0x180 [zfs]
[ 2341.763262] [] zio_buf_free+0xc07/0x17b0 [zfs]
[ 2341.763273] [] zio_nowait+0xb6/0x180 [zfs]
[ 2341.763284] [] vdev_config_sync+0xaff/0xd80 [zfs]
[ 2341.763295] [] ? vdev_config_sync+0x140/0xd80 [zfs]
[ 2341.763305] [] zio_buf_free+0xc57/0x17b0 [zfs]
[ 2341.763315] [] zio_nowait+0xb6/0x180 [zfs]
[ 2341.763321] [] arc_read+0x31a/0x960 [zfs]
[ 2341.763327] [] dbuf_read+0x254/0xd00 [zfs]
[ 2341.763334] [] dbuf_read+0xc7a/0xd00 [zfs]
[ 2341.763340] [] dbuf_hold_impl+0x76/0xa0 [zfs]
[ 2341.763346] [] dbuf_hold+0x1b/0x30 [zfs]
[ 2341.763353] [] dmu_buf_hold+0x29d/0x6b0 [zfs]
[ 2341.763360] [] dmu_read+0x97/0x2d0 [zfs]
[ 2341.763362] [] ? mutex_unlock+0x9/0x10
[ 2341.763373] [] zfs_getpage+0x12d/0x200 [zfs]
[ 2341.763383] [] ? zpl_putpage+0x220/0x830 [zfs]
[ 2341.763392] [] zpl_putpage+0x247/0x830 [zfs]
[ 2341.763395] [] read_cache_pages+0xba/0x120
[ 2341.763404] [] zpl_putpage+0x99/0x830 [zfs]
[ 2341.763406] [] __do_page_cache_readahead+0x1d4/0x290
[ 2341.763408] [] ra_submit+0x1c/0x20
[ 2341.763410] [] filemap_fault+0x385/0x420
[ 2341.763412] [] __do_fault+0x6e/0x530
[ 2341.763414] [] handle_mm_fault+0x1c3/0xcb0
[ 2341.763416] [] ? _raw_spin_unlock_irqrestore+0x19/0x40
[ 2341.763419] [] ? timerfd_poll+0x50/0x60
[ 2341.763421] [] ? ep_send_events_proc+0x9f/0x1c0
[ 2341.763423] [] __do_page_fault+0x16c/0x580
[ 2341.763426] [] ? acct_account_cputime+0x17/0x20
[ 2341.763428] [] ? account_user_time+0x87/0x90
[ 2341.763430] [] ? _raw_spin_unlock+0x13/0x30
[ 2341.763432] [] ? vtime_account_user+0x4f/0x60
[ 2341.763434] [] do_page_fault+0x1e/0x70
[ 2341.763436] [] page_fault+0x22/0x30
[ 2341.763441] INFO: task kswapd0:62 blocked for more than 20 seconds.
[ 2341.763442] Tainted: P W O 3.14.2-aufs #4
[ 2341.763443] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 2341.763444] kswapd0 D ffff88012664be00 3464 62 2 0x00000000
[ 2341.763446] ffff880124fad968 0000000000000002 ffff880129390000 ffff880124fadfd8
[ 2341.763448] 0000000000013280 0000000000013280 ffff880129390000 ffff8800c6a4e428
[ 2341.763450] ffff8800c6a4e268 ffff8800c6a4e430 0000000000000000 ffff8800c6a4e3c0
[ 2341.763452] Call Trace:
[ 2341.763454] [] schedule+0x24/0x70
[ 2341.763459] [] __cv_destroy+0x19d/0x1d0 [spl]
[ 2341.763462] [] ? prepare_to_wait_event+0xf0/0xf0
[ 2341.763466] [] __cv_wait+0x10/0x20 [spl]
[ 2341.763476] [] txg_wait_open+0x83/0xb0 [zfs]
[ 2341.763485] [] dmu_tx_wait+0x305/0x310 [zfs]
[ 2341.763487] [] ? mutex_unlock+0x9/0x10
[ 2341.763501] [] dmu_tx_assign+0x95/0xc90 [zfs]
[ 2341.763520] [] zfs_inactive+0x15b/0x210 [zfs]
[ 2341.763530] [] zpl_vap_init+0x64f/0x7b0 [zfs]
[ 2341.763532] [] evict+0xab/0x1a0
[ 2341.763534] [] dispose_list+0x31/0x40
[ 2341.763535] [] prune_icache_sb+0x42/0x50
[ 2341.763538] [] super_cache_scan+0x100/0x170
[ 2341.763540] [] shrink_slab_node+0x14b/0x2f0
[ 2341.763543] [] ? css_next_descendant_pre+0x1f/0x60
[ 2341.763545] [] shrink_slab+0x86/0x180
[ 2341.763547] [] kswapd_shrink_zone+0x125/0x1c0
[ 2341.763550] [] kswapd+0x4c6/0x890
[ 2341.763552] [] ? mem_cgroup_shrink_node_zone+0x180/0x180
[ 2341.763555] [] kthread+0xdf/0x100
[ 2341.763557] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 2341.763559] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 2341.763561] [] ret_from_fork+0x7c/0xb0
[ 2341.763563] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 2341.763571] INFO: task z_rd_int/0:691 blocked for more than 20 seconds.
[ 2341.763572] Tainted: P W O 3.14.2-aufs #4
[ 2341.763573] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 2341.763574] z_rd_int/0 D ffff88012fa13280 5304 691 2 0x00000000
[ 2341.763576] ffff880035d25cd8 0000000000000002 ffff8800c651c6f0 ffff880035d25fd8
[ 2341.763578] 0000000000013280 0000000000013280 ffff8800c651c6f0 ffff8800c6d1b890
[ 2341.763580] ffff8800c6d1b890 ffff8800c651c6f0 ffff8800c6d1b898 0000000000000246
[ 2341.763582] Call Trace:
[ 2341.763584] [] schedule_preempt_disabled+0x2e/0x80
[ 2341.763586] [] __mutex_lock_slowpath+0x12f/0x3a0
[ 2341.763588] [] mutex_lock+0x23/0x33
[ 2341.763599] [] vdev_queue_io_done+0x46/0x36d0 [zfs]
[ 2341.763601] [] ? mutex_unlock+0x9/0x10
[ 2341.763611] [] zio_buf_free+0x960/0x17b0 [zfs]
[ 2341.763620] [] zio_execute+0xa6/0x140 [zfs]
[ 2341.763624] [] taskq_cancel_id+0x2e8/0x490 [spl]
[ 2341.763626] [] ? wake_up_state+0x10/0x10
[ 2341.763629] [] ? taskq_cancel_id+0x130/0x490 [spl]
[ 2341.763631] [] kthread+0xdf/0x100
[ 2341.763633] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 2341.763635] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 2341.763637] [] ret_from_fork+0x7c/0xb0
[ 2341.763638] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 2341.763640] INFO: task z_rd_int/1:692 blocked for more than 20 seconds.

@behlendorf behlendorf added this to the 0.7.0 milestone May 7, 2014
@behlendorf behlendorf added the Bug label May 7, 2014
@andreevmipt
Copy link
Author

The same problem happened with stable version 0.6.2-r5 (in gentoo repo).

[ 383.521769] INFO: task systemd:1 blocked for more than 20 seconds.
[ 383.521773] Tainted: P W O 3.14.1-geek #2
[ 383.521774] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.521776] systemd D ffff88012fa13280 3976 1 0 0x00000000
[ 383.521779] ffff880129ac3958 0000000000000002 ffff880129ae0000 ffff880129ac3fd8
[ 383.521782] 0000000000013280 0000000000013280 ffff880129ae0000 ffff88012fa13ba8
[ 383.521783] ffff88009210a508 ffff88009210a560 0000000000000001 0000000000000000
[ 383.521785] Call Trace:
[ 383.521792] [] io_schedule+0x98/0x130
[ 383.521802] [] __cv_destroy+0x153/0x1d0 [spl]
[ 383.521806] [] ? prepare_to_wait_event+0xf0/0xf0
[ 383.521809] [] __cv_wait_io+0x13/0x150 [spl]
[ 383.521821] [] zio_wait+0x11b/0x1e0 [zfs]
[ 383.521829] [] dmu_prefetch+0x48c/0x850 [zfs]
[ 383.521835] [] dmu_read+0x97/0x2d0 [zfs]
[ 383.521846] [] zfs_getpage+0x12d/0x200 [zfs]
[ 383.521856] [] ? zpl_putpage+0x90/0x410 [zfs]
[ 383.521864] [] zpl_putpage+0xb7/0x410 [zfs]
[ 383.521867] [] read_cache_pages+0xba/0x120
[ 383.521876] [] zpl_putpage+0x69/0x410 [zfs]
[ 383.521878] [] __do_page_cache_readahead+0x1d4/0x290
[ 383.521879] [] ra_submit+0x1c/0x20
[ 383.521882] [] filemap_fault+0x385/0x420
[ 383.521884] [] __do_fault+0x6e/0x530
[ 383.521886] [] handle_mm_fault+0x1c3/0xcb0
[ 383.521888] [] ? _raw_spin_unlock_irqrestore+0x19/0x40
[ 383.521891] [] ? timerfd_poll+0x50/0x60
[ 383.521893] [] ? ep_send_events_proc+0x9f/0x1c0
[ 383.521895] [] __do_page_fault+0x16c/0x580
[ 383.521898] [] ? acct_account_cputime+0x17/0x20
[ 383.521900] [] ? account_user_time+0x87/0x90
[ 383.521902] [] ? _raw_spin_unlock+0x13/0x30
[ 383.521904] [] ? vtime_account_user+0x4f/0x60
[ 383.521905] [] do_page_fault+0x1e/0x70
[ 383.521907] [] page_fault+0x22/0x30
[ 383.521912] INFO: task kswapd0:61 blocked for more than 20 seconds.
[ 383.521913] Tainted: P W O 3.14.1-geek #2
[ 383.521914] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.521915] kswapd0 D ffff88012fb93280 4072 61 2 0x00000000
[ 383.521917] ffff88011e22d980 0000000000000002 ffff880129390000 ffff88011e22dfd8
[ 383.521919] 0000000000013280 0000000000013280 ffff880129390000 ffff880124c7dbe0
[ 383.521921] ffff880124c7da28 ffff880124c7dbe8 0000000000000000 ffff880124c7db78
[ 383.521923] Call Trace:
[ 383.521925] [] schedule+0x24/0x70
[ 383.521928] [] __cv_destroy+0x19d/0x1d0 [spl]
[ 383.521930] [] ? prepare_to_wait_event+0xf0/0xf0
[ 383.521933] [] __cv_wait+0x10/0x20 [spl]
[ 383.521943] [] txg_wait_open+0x83/0xb0 [zfs]
[ 383.521951] [] dmu_tx_wait+0x10d/0x110 [zfs]
[ 383.521958] [] dmu_tx_assign+0x87/0x4a0 [zfs]
[ 383.521968] [] zfs_inactive+0x15b/0x210 [zfs]
[ 383.521977] [] zpl_vap_init+0x51f/0x600 [zfs]
[ 383.521980] [] evict+0xab/0x1a0
[ 383.521981] [] dispose_list+0x31/0x40
[ 383.521983] [] prune_icache_sb+0x42/0x50
[ 383.521986] [] super_cache_scan+0x100/0x170
[ 383.521988] [] shrink_slab_node+0x14b/0x2f0
[ 383.521991] [] ? css_next_descendant_pre+0x1f/0x60
[ 383.521993] [] shrink_slab+0x86/0x180
[ 383.521995] [] kswapd_shrink_zone+0x125/0x1c0
[ 383.521997] [] kswapd+0x4af/0x880
[ 383.522000] [] ? mem_cgroup_shrink_node_zone+0x180/0x180
[ 383.522002] [] kthread+0xdf/0x100
[ 383.522004] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 383.522006] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522008] [] ret_from_fork+0x7c/0xb0
[ 383.522010] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522019] INFO: task z_rd_int/0:791 blocked for more than 20 seconds.
[ 383.522020] Tainted: P W O 3.14.1-geek #2
[ 383.522021] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522022] z_rd_int/0 D ffff88012fa13280 5720 791 2 0x00000000
[ 383.522024] ffff88012562dce8 0000000000000002 ffff8800c81e1c60 ffff88012562dfd8
[ 383.522026] 0000000000013280 0000000000013280 ffff8800c81e1c60 ffff8800c78b4808
[ 383.522028] ffff8800c78b4808 ffff8800c81e1c60 ffff8800c78b4810 0000000000000246
[ 383.522030] Call Trace:
[ 383.522032] [] schedule_preempt_disabled+0x2e/0x80
[ 383.522034] [] __mutex_lock_slowpath+0x12f/0x3a0
[ 383.522036] [] mutex_lock+0x23/0x33
[ 383.522047] [] vdev_queue_io_done+0x4a/0x3660 [zfs]
[ 383.522057] [] zio_buf_free+0x930/0x1770 [zfs]
[ 383.522066] [] zio_execute+0xa6/0x140 [zfs]
[ 383.522069] [] taskq_cancel_id+0x2e8/0x490 [spl]
[ 383.522071] [] ? wake_up_state+0x10/0x10
[ 383.522074] [] ? taskq_cancel_id+0x130/0x490 [spl]
[ 383.522076] [] kthread+0xdf/0x100
[ 383.522077] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 383.522079] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522081] [] ret_from_fork+0x7c/0xb0
[ 383.522083] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522084] INFO: task z_rd_int/2:793 blocked for more than 20 seconds.
[ 383.522085] Tainted: P W O 3.14.1-geek #2
[ 383.522086] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522087] z_rd_int/2 D ffff88012fb13280 5656 793 2 0x00000000
[ 383.522089] ffff880125751ce8 0000000000000002 ffff8800c81e0e30 ffff880125751fd8
[ 383.522091] 0000000000013280 0000000000013280 ffff8800c81e0e30 ffff8800c78b4808
[ 383.522093] ffff8800c78b4808 ffff8800c81e0e30 ffff8800c78b4810 0000000000000246
[ 383.522094] Call Trace:
[ 383.522097] [] schedule_preempt_disabled+0x2e/0x80
[ 383.522099] [] __mutex_lock_slowpath+0x12f/0x3a0
[ 383.522101] [] mutex_lock+0x23/0x33
[ 383.522111] [] vdev_queue_io_done+0x4a/0x3660 [zfs]
[ 383.522120] [] zio_buf_free+0x930/0x1770 [zfs]
[ 383.522129] [] zio_execute+0xa6/0x140 [zfs]
[ 383.522132] [] taskq_cancel_id+0x2e8/0x490 [spl]
[ 383.522134] [] ? wake_up_state+0x10/0x10
[ 383.522136] [] ? taskq_cancel_id+0x130/0x490 [spl]
[ 383.522138] [] kthread+0xdf/0x100
[ 383.522140] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 383.522142] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522143] [] ret_from_fork+0x7c/0xb0
[ 383.522145] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522147] INFO: task z_rd_int/3:794 blocked for more than 20 seconds.
[ 383.522148] Tainted: P W O 3.14.1-geek #2
[ 383.522148] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522149] z_rd_int/3 D ffff88012fb93280 5840 794 2 0x00000000
[ 383.522151] ffff880125753ce8 0000000000000002 ffff8800c617f180 ffff880125753fd8
[ 383.522153] 0000000000013280 0000000000013280 ffff8800c617f180 ffff8800c78b4808
[ 383.522155] ffff8800c78b4808 ffff8800c617f180 ffff8800c78b4810 0000000000000246
[ 383.522157] Call Trace:
[ 383.522159] [] schedule_preempt_disabled+0x2e/0x80
[ 383.522161] [] __mutex_lock_slowpath+0x12f/0x3a0
[ 383.522163] [] mutex_lock+0x23/0x33
[ 383.522173] [] vdev_queue_io_done+0x4a/0x3660 [zfs]
[ 383.522182] [] zio_buf_free+0x930/0x1770 [zfs]
[ 383.522191] [] zio_execute+0xa6/0x140 [zfs]
[ 383.522194] [] taskq_cancel_id+0x2e8/0x490 [spl]
[ 383.522196] [] ? wake_up_state+0x10/0x10
[ 383.522198] [] ? taskq_cancel_id+0x130/0x490 [spl]
[ 383.522200] [] kthread+0xdf/0x100
[ 383.522202] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 383.522204] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522205] [] ret_from_fork+0x7c/0xb0
[ 383.522207] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522209] INFO: task z_wr_iss/0:795 blocked for more than 20 seconds.
[ 383.522209] Tainted: P W O 3.14.1-geek #2
[ 383.522210] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522211] z_wr_iss/0 D ffff88012fa13280 4880 795 2 0x00000000
[ 383.522213] ffff8800c7959be8 0000000000000002 ffff8800c6178e30 ffff8800c7959fd8
[ 383.522215] 0000000000013280 0000000000013280 ffff8800c6178e30 ffff8800c78b4808
[ 383.522217] ffff8800c78b4808 ffff8800c6178e30 ffff8800c78b4810 0000000000000246
[ 383.522218] Call Trace:
[ 383.522221] [] schedule_preempt_disabled+0x2e/0x80
[ 383.522222] [] __mutex_lock_slowpath+0x12f/0x3a0
[ 383.522225] [] mutex_lock+0x23/0x33
[ 383.522235] [] vdev_queue_io+0x6f/0x130 [zfs]
[ 383.522244] [] zio_buf_free+0xbd7/0x1770 [zfs]
[ 383.522252] [] zio_nowait+0xb6/0x180 [zfs]
[ 383.522263] [] vdev_config_sync+0x968/0xdc0 [zfs]
[ 383.522273] [] ? vdev_config_sync+0x140/0xdc0 [zfs]
[ 383.522275] [] ? mutex_unlock+0x9/0x10
[ 383.522284] [] ? spa_config_enter+0xdd/0x110 [zfs]
[ 383.522293] [] zio_buf_free+0xc27/0x1770 [zfs]
[ 383.522302] [] zio_execute+0xa6/0x140 [zfs]
[ 383.522305] [] taskq_cancel_id+0x2e8/0x490 [spl]
[ 383.522307] [] ? wake_up_state+0x10/0x10
[ 383.522309] [] ? taskq_cancel_id+0x130/0x490 [spl]
[ 383.522311] [] kthread+0xdf/0x100
[ 383.522313] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 383.522315] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522316] [] ret_from_fork+0x7c/0xb0
[ 383.522318] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522325] INFO: task txg_sync:865 blocked for more than 20 seconds.
[ 383.522326] Tainted: P W O 3.14.1-geek #2
[ 383.522327] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522327] txg_sync D ffff88012fa93280 4488 865 2 0x00000000
[ 383.522330] ffff880126f7fbd8 0000000000000002 ffff8800c4f246f0 ffff880126f7ffd8
[ 383.522331] 0000000000013280 0000000000013280 ffff8800c4f246f0 ffff88012fa93ba8
[ 383.522333] ffff8800701429d8 ffff880070142a30 0000000000000001 0000000000000000
[ 383.522335] Call Trace:
[ 383.522337] [] io_schedule+0x98/0x130
[ 383.522340] [] __cv_destroy+0x153/0x1d0 [spl]
[ 383.522342] [] ? prepare_to_wait_event+0xf0/0xf0
[ 383.522345] [] __cv_wait_io+0x13/0x150 [spl]
[ 383.522354] [] zio_wait+0x11b/0x1e0 [zfs]
[ 383.522362] [] dsl_pool_sync+0xdf/0x5b0 [zfs]
[ 383.522372] [] ? spa_errlog_sync+0x1f3/0x230 [zfs]
[ 383.522374] [] ? prepare_to_wait_event+0xf0/0xf0
[ 383.522383] [] spa_sync+0x417/0xa80 [zfs]
[ 383.522385] [] ? try_to_wake_up+0x187/0x2c0
[ 383.522395] [] txg_init+0x420/0x5d0 [zfs]
[ 383.522404] [] ? txg_init+0x1e0/0x5d0 [zfs]
[ 383.522407] [] __thread_exit+0x95/0xb0 [spl]
[ 383.522410] [] ? __thread_exit+0x20/0xb0 [spl]
[ 383.522411] [] kthread+0xdf/0x100
[ 383.522413] [] ? arch_vtime_task_switch+0x8e/0xa0
[ 383.522415] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522417] [] ret_from_fork+0x7c/0xb0
[ 383.522419] [] ? kthread_create_on_node+0x1a0/0x1a0
[ 383.522422] INFO: task vmware-usbarbit:1691 blocked for more than 20 seconds.
[ 383.522423] Tainted: P W O 3.14.1-geek #2
[ 383.522424] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522425] vmware-usbarbit D ffff88012fb13280 4392 1691 1 0x00000000
[ 383.522427] ffff8800aa29f958 0000000000000002 ffff8800b93138c0 ffff8800aa29ffd8
[ 383.522429] 0000000000013280 0000000000013280 ffff8800b93138c0 ffff88012fb13ba8
[ 383.522430] ffff88010417ed38 ffff88010417ed90 0000000000000001 0000000000000000
[ 383.522432] Call Trace:
[ 383.522434] [] io_schedule+0x98/0x130
[ 383.522437] [] __cv_destroy+0x153/0x1d0 [spl]
[ 383.522439] [] ? prepare_to_wait_event+0xf0/0xf0
[ 383.522442] [] __cv_wait_io+0x13/0x150 [spl]
[ 383.522451] [] zio_wait+0x11b/0x1e0 [zfs]
[ 383.522458] [] dmu_prefetch+0x48c/0x850 [zfs]
[ 383.522465] [] dmu_read+0x97/0x2d0 [zfs]
[ 383.522475] [] zfs_getpage+0x12d/0x200 [zfs]
[ 383.522484] [] ? zpl_putpage+0x90/0x410 [zfs]
[ 383.522492] [] zpl_putpage+0xb7/0x410 [zfs]
[ 383.522494] [] read_cache_pages+0xba/0x120
[ 383.522502] [] zpl_putpage+0x69/0x410 [zfs]
[ 383.522504] [] __do_page_cache_readahead+0x1d4/0x290
[ 383.522506] [] ra_submit+0x1c/0x20
[ 383.522508] [] filemap_fault+0x385/0x420
[ 383.522510] [] __do_fault+0x6e/0x530
[ 383.522511] [] handle_mm_fault+0x1c3/0xcb0
[ 383.522514] [] ? poll_select_copy_remaining+0x130/0x130
[ 383.522516] [] __do_page_fault+0x16c/0x580
[ 383.522518] [] ? acct_account_cputime+0x17/0x20
[ 383.522520] [] ? account_user_time+0x87/0x90
[ 383.522521] [] ? _raw_spin_unlock+0x13/0x30
[ 383.522523] [] ? vtime_account_user+0x4f/0x60
[ 383.522525] [] do_page_fault+0x1e/0x70
[ 383.522526] [] page_fault+0x22/0x30
[ 383.522528] INFO: task acpid:1692 blocked for more than 20 seconds.
[ 383.522529] Tainted: P W O 3.14.1-geek #2
[ 383.522529] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522530] acpid D ffff88012fb13280 4912 1692 1 0x00000000
[ 383.522532] ffff8800a8dd5958 0000000000000002 ffff8800b9311c60 ffff8800a8dd5fd8
[ 383.522534] 0000000000013280 0000000000013280 ffff8800b9311c60 ffff88012fb13ba8
[ 383.522536] ffff880052e5b848 ffff880052e5b8a0 0000000000000001 0000000000000000
[ 383.522538] Call Trace:
[ 383.522540] [] io_schedule+0x98/0x130
[ 383.522543] [] __cv_destroy+0x153/0x1d0 [spl]
[ 383.522545] [] ? prepare_to_wait_event+0xf0/0xf0
[ 383.522548] [] __cv_wait_io+0x13/0x150 [spl]
[ 383.522557] [] zio_wait+0x11b/0x1e0 [zfs]
[ 383.522563] [] dmu_prefetch+0x48c/0x850 [zfs]
[ 383.522570] [] dmu_read+0x97/0x2d0 [zfs]
[ 383.522580] [] zfs_getpage+0x12d/0x200 [zfs]
[ 383.522588] [] ? zpl_putpage+0x90/0x410 [zfs]
[ 383.522596] [] zpl_putpage+0xb7/0x410 [zfs]
[ 383.522598] [] read_cache_pages+0xba/0x120
[ 383.522606] [] zpl_putpage+0x69/0x410 [zfs]
[ 383.522607] [] __do_page_cache_readahead+0x1d4/0x290
[ 383.522609] [] ra_submit+0x1c/0x20
[ 383.522611] [] filemap_fault+0x385/0x420
[ 383.522613] [] __do_fault+0x6e/0x530
[ 383.522614] [] handle_mm_fault+0x1c3/0xcb0
[ 383.522616] [] __do_page_fault+0x16c/0x580
[ 383.522618] [] ? acct_account_cputime+0x17/0x20
[ 383.522620] [] ? account_user_time+0x87/0x90
[ 383.522621] [] ? _raw_spin_unlock+0x13/0x30
[ 383.522623] [] ? vtime_account_user+0x4f/0x60
[ 383.522625] [] do_page_fault+0x1e/0x70
[ 383.522626] [] page_fault+0x22/0x30
[ 383.522632] INFO: task X:1914 blocked for more than 20 seconds.
[ 383.522633] Tainted: P W O 3.14.1-geek #2
[ 383.522633] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 383.522634] X D ffff8800c8b89740 3664 1914 1912 0x00400000
[ 383.522636] ffff8800bb6c3958 0000000000003002 ffff880122531c60 ffff8800bb6c3fd8
[ 383.522638] 0000000000013280 0000000000013280 ffff880122531c60 ffff88012fb93ba8
[ 383.522640] ffff880025f69698 ffff880025f696f0 0000000000000001 0000000000000000
[ 383.522642] Call Trace:
[ 383.522644] [] io_schedule+0x98/0x130
[ 383.522647] [] __cv_destroy+0x153/0x1d0 [spl]
[ 383.522649] [] ? prepare_to_wait_event+0xf0/0xf0
[ 383.522652] [] __cv_wait_io+0x13/0x150 [spl]
[ 383.522660] [] zio_wait+0x11b/0x1e0 [zfs]
[ 383.522667] [] dmu_prefetch+0x48c/0x850 [zfs]
[ 383.522673] [] dmu_read+0x97/0x2d0 [zfs]
[ 383.522683] [] zfs_getpage+0x12d/0x200 [zfs]
[ 383.522691] [] ? zpl_putpage+0x90/0x410 [zfs]
[ 383.522699] [] zpl_putpage+0xb7/0x410 [zfs]
[ 383.522701] [] read_cache_pages+0xba/0x120
[ 383.522709] [] zpl_putpage+0x69/0x410 [zfs]
[ 383.522711] [] __do_page_cache_readahead+0x1d4/0x290
[ 383.522713] [] ra_submit+0x1c/0x20
[ 383.522715] [] filemap_fault+0x385/0x420
[ 383.522716] [] __do_fault+0x6e/0x530
[ 383.522718] [] handle_mm_fault+0x1c3/0xcb0
[ 383.522720] [] ? recalc_sigpending+0x16/0x50
[ 383.522722] [] __do_page_fault+0x16c/0x580
[ 383.522724] [] ? acct_account_cputime+0x17/0x20
[ 383.522725] [] ? account_user_time+0x87/0x90
[ 383.522727] [] ? _raw_spin_unlock+0x13/0x30
[ 383.522728] [] ? vtime_account_user+0x4f/0x60
[ 383.522730] [] do_page_fault+0x1e/0x70
[ 383.522732] [] page_fault+0x22/0x30
[ 397.756823] SysRq : Terminate All Tasks

@ioquatix
Copy link

I had a similar issue, running ZFS latest stable (0.6.3 g07dabd2) on Arch, 8GB ECC memory, but running a minecraft server which uses up to 2G.

[2012026.313208] INFO: task txg_sync:561 blocked for more than 120 seconds.
[2012026.313367]       Tainted: P           O  3.15.2-1-ARCH #1
[2012026.313463] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2012026.313597] txg_sync        D 0000000000000000     0   561      2 0x00000000
[2012026.313607]  ffff8800caef3bb0 0000000000000046 ffff88021d97bd20 0000000000014700
[2012026.313616]  ffff8800caef3fd8 0000000000014700 ffff88021d97bd20 ffffffffa028ccaa
[2012026.313623]  ffff8800cb172000 ffff8800cb172080 ffffffffa03e14c0 ffffc9001998d360
[2012026.313630] Call Trace:
[2012026.313665]  [<ffffffffa028ccaa>] ? taskq_dispatch_ent+0x6a/0x1d0 [spl]
[2012026.313716]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2012026.313764]  [<ffffffffa0389eb1>] ? spa_taskq_dispatch_ent+0x61/0x80 [zfs]
[2012026.313802]  [<ffffffffa03e00d2>] ? zio_taskq_dispatch+0x92/0xa0 [zfs]
[2012026.313813]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2012026.313821]  [<ffffffff81509dd4>] io_schedule+0x94/0xf0
[2012026.313834]  [<ffffffffa029415c>] cv_wait_common+0xac/0x1b0 [spl]
[2012026.313872]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2012026.313881]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2012026.313895]  [<ffffffffa02942b8>] __cv_wait_io+0x18/0x20 [spl]
[2012026.313932]  [<ffffffffa03e3343>] zio_wait+0x113/0x1d0 [zfs]
[2012026.313974]  [<ffffffffa0370bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[2012026.314019]  [<ffffffffa03894ed>] spa_sync+0x42d/0xb10 [zfs]
[2012026.314028]  [<ffffffff810d4868>] ? ktime_get_ts+0x48/0xf0
[2012026.314073]  [<ffffffffa039a202>] txg_sync_thread+0x382/0x5f0 [zfs]
[2012026.314120]  [<ffffffffa0399e80>] ? txg_delay+0xf0/0xf0 [zfs]
[2012026.314134]  [<ffffffffa028c2ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[2012026.314146]  [<ffffffffa028c250>] ? __thread_exit+0xa0/0xa0 [spl]
[2012026.314154]  [<ffffffff8108d3ca>] kthread+0xea/0x100
[2012026.314161]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2012026.314170]  [<ffffffff815159bc>] ret_from_fork+0x7c/0xb0
[2012026.314177]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051039.987032] INFO: task txg_sync:561 blocked for more than 120 seconds.
[2051039.987188]       Tainted: P           O  3.15.2-1-ARCH #1
[2051039.987283] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051039.987417] txg_sync        D 0000000000000000     0   561      2 0x00000000
[2051039.987428]  ffff8800caef3bb0 0000000000000046 ffff88021d97bd20 0000000000014700
[2051039.987436]  ffff8800caef3fd8 0000000000014700 ffff88021d97bd20 0000000000000046
[2051039.987443]  ffff8800caef3b38 ffffffff8109dfbf ffffffffa03e14c0 ffffc90036714c20
[2051039.987449] Call Trace:
[2051039.987467]  [<ffffffff8109dfbf>] ? try_to_wake_up+0x1ff/0x2e0
[2051039.987527]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051039.987537]  [<ffffffff8109e102>] ? default_wake_function+0x12/0x20
[2051039.987543]  [<ffffffff810b14c5>] ? __wake_up_common+0x55/0x90
[2051039.987552]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051039.987559]  [<ffffffff81509dd4>] io_schedule+0x94/0xf0
[2051039.987576]  [<ffffffffa029415c>] cv_wait_common+0xac/0x1b0 [spl]
[2051039.987614]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051039.987620]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051039.987635]  [<ffffffffa02942b8>] __cv_wait_io+0x18/0x20 [spl]
[2051039.987672]  [<ffffffffa03e3343>] zio_wait+0x113/0x1d0 [zfs]
[2051039.987714]  [<ffffffffa0370bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[2051039.987760]  [<ffffffffa03894ed>] spa_sync+0x42d/0xb10 [zfs]
[2051039.987768]  [<ffffffff810d4868>] ? ktime_get_ts+0x48/0xf0
[2051039.987813]  [<ffffffffa039a202>] txg_sync_thread+0x382/0x5f0 [zfs]
[2051039.987860]  [<ffffffffa0399e80>] ? txg_delay+0xf0/0xf0 [zfs]
[2051039.987874]  [<ffffffffa028c2ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[2051039.987887]  [<ffffffffa028c250>] ? __thread_exit+0xa0/0xa0 [spl]
[2051039.987895]  [<ffffffff8108d3ca>] kthread+0xea/0x100
[2051039.987902]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051039.987910]  [<ffffffff815159bc>] ret_from_fork+0x7c/0xb0
[2051039.987917]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051160.029037] INFO: task txg_sync:561 blocked for more than 120 seconds.
[2051160.029163]       Tainted: P           O  3.15.2-1-ARCH #1
[2051160.029215] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051160.029287] txg_sync        D 0000000000000000     0   561      2 0x00000000
[2051160.029293]  ffff8800caef3bb0 0000000000000046 ffff88021d97bd20 0000000000014700
[2051160.029297]  ffff8800caef3fd8 0000000000014700 ffff88021d97bd20 0000000000000046
[2051160.029301]  ffff8800caef3b38 ffffffff8109dfbf ffffffffa03e14c0 ffffc90036714c20
[2051160.029305] Call Trace:
[2051160.029316]  [<ffffffff8109dfbf>] ? try_to_wake_up+0x1ff/0x2e0
[2051160.029354]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051160.029359]  [<ffffffff8109e102>] ? default_wake_function+0x12/0x20
[2051160.029363]  [<ffffffff810b14c5>] ? __wake_up_common+0x55/0x90
[2051160.029369]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051160.029373]  [<ffffffff81509dd4>] io_schedule+0x94/0xf0
[2051160.029382]  [<ffffffffa029415c>] cv_wait_common+0xac/0x1b0 [spl]
[2051160.029403]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051160.029406]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051160.029414]  [<ffffffffa02942b8>] __cv_wait_io+0x18/0x20 [spl]
[2051160.029434]  [<ffffffffa03e3343>] zio_wait+0x113/0x1d0 [zfs]
[2051160.029457]  [<ffffffffa0370bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[2051160.029481]  [<ffffffffa03894ed>] spa_sync+0x42d/0xb10 [zfs]
[2051160.029486]  [<ffffffff810d4868>] ? ktime_get_ts+0x48/0xf0
[2051160.029510]  [<ffffffffa039a202>] txg_sync_thread+0x382/0x5f0 [zfs]
[2051160.029535]  [<ffffffffa0399e80>] ? txg_delay+0xf0/0xf0 [zfs]
[2051160.029543]  [<ffffffffa028c2ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[2051160.029550]  [<ffffffffa028c250>] ? __thread_exit+0xa0/0xa0 [spl]
[2051160.029554]  [<ffffffff8108d3ca>] kthread+0xea/0x100
[2051160.029558]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051160.029563]  [<ffffffff815159bc>] ret_from_fork+0x7c/0xb0
[2051160.029567]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051280.071088] INFO: task txg_sync:561 blocked for more than 120 seconds.
[2051280.071201]       Tainted: P           O  3.15.2-1-ARCH #1
[2051280.071252] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051280.071324] txg_sync        D 0000000000000000     0   561      2 0x00000000
[2051280.071330]  ffff8800caef3bb0 0000000000000046 ffff88021d97bd20 0000000000014700
[2051280.071334]  ffff8800caef3fd8 0000000000014700 ffff88021d97bd20 0000000000000046
[2051280.071338]  ffff8800caef3b38 ffffffff8109dfbf ffffffffa03e14c0 ffffc90036714c20
[2051280.071342] Call Trace:
[2051280.071353]  [<ffffffff8109dfbf>] ? try_to_wake_up+0x1ff/0x2e0
[2051280.071390]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051280.071395]  [<ffffffff8109e102>] ? default_wake_function+0x12/0x20
[2051280.071399]  [<ffffffff810b14c5>] ? __wake_up_common+0x55/0x90
[2051280.071405]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051280.071408]  [<ffffffff81509dd4>] io_schedule+0x94/0xf0
[2051280.071418]  [<ffffffffa029415c>] cv_wait_common+0xac/0x1b0 [spl]
[2051280.071438]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051280.071441]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051280.071449]  [<ffffffffa02942b8>] __cv_wait_io+0x18/0x20 [spl]
[2051280.071469]  [<ffffffffa03e3343>] zio_wait+0x113/0x1d0 [zfs]
[2051280.071492]  [<ffffffffa0370bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[2051280.071516]  [<ffffffffa03894ed>] spa_sync+0x42d/0xb10 [zfs]
[2051280.071521]  [<ffffffff810d4868>] ? ktime_get_ts+0x48/0xf0
[2051280.071545]  [<ffffffffa039a202>] txg_sync_thread+0x382/0x5f0 [zfs]
[2051280.071570]  [<ffffffffa0399e80>] ? txg_delay+0xf0/0xf0 [zfs]
[2051280.071578]  [<ffffffffa028c2ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[2051280.071584]  [<ffffffffa028c250>] ? __thread_exit+0xa0/0xa0 [spl]
[2051280.071589]  [<ffffffff8108d3ca>] kthread+0xea/0x100
[2051280.071593]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051280.071598]  [<ffffffff815159bc>] ret_from_fork+0x7c/0xb0
[2051280.071601]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051400.113196] INFO: task txg_sync:561 blocked for more than 120 seconds.
[2051400.113338]       Tainted: P           O  3.15.2-1-ARCH #1
[2051400.113919] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051400.114065] txg_sync        D 0000000000000000     0   561      2 0x00000000
[2051400.114074]  ffff8800caef3bb0 0000000000000046 ffff88021d97bd20 0000000000014700
[2051400.114081]  ffff8800caef3fd8 0000000000014700 ffff88021d97bd20 0000000000000046
[2051400.114086]  ffff8800caef3b38 ffffffff8109dfbf ffffffffa03e14c0 ffffc90036714c20
[2051400.114092] Call Trace:
[2051400.114107]  [<ffffffff8109dfbf>] ? try_to_wake_up+0x1ff/0x2e0
[2051400.114154]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051400.114163]  [<ffffffff8109e102>] ? default_wake_function+0x12/0x20
[2051400.114169]  [<ffffffff810b14c5>] ? __wake_up_common+0x55/0x90
[2051400.114176]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051400.114182]  [<ffffffff81509dd4>] io_schedule+0x94/0xf0
[2051400.114195]  [<ffffffffa029415c>] cv_wait_common+0xac/0x1b0 [spl]
[2051400.114225]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051400.114230]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051400.114242]  [<ffffffffa02942b8>] __cv_wait_io+0x18/0x20 [spl]
[2051400.114272]  [<ffffffffa03e3343>] zio_wait+0x113/0x1d0 [zfs]
[2051400.114305]  [<ffffffffa0370bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[2051400.114342]  [<ffffffffa03894ed>] spa_sync+0x42d/0xb10 [zfs]
[2051400.114349]  [<ffffffff810d4868>] ? ktime_get_ts+0x48/0xf0
[2051400.114385]  [<ffffffffa039a202>] txg_sync_thread+0x382/0x5f0 [zfs]
[2051400.114422]  [<ffffffffa0399e80>] ? txg_delay+0xf0/0xf0 [zfs]
[2051400.114434]  [<ffffffffa028c2ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[2051400.114444]  [<ffffffffa028c250>] ? __thread_exit+0xa0/0xa0 [spl]
[2051400.114450]  [<ffffffff8108d3ca>] kthread+0xea/0x100
[2051400.114456]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051400.114463]  [<ffffffff815159bc>] ret_from_fork+0x7c/0xb0
[2051400.114468]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051760.239431] INFO: task txg_sync:561 blocked for more than 120 seconds.
[2051760.239587]       Tainted: P           O  3.15.2-1-ARCH #1
[2051760.239682] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051760.239816] txg_sync        D 0000000000000000     0   561      2 0x00000000
[2051760.239826]  ffff8800caef3bb0 0000000000000046 ffff88021d97bd20 0000000000014700
[2051760.239834]  ffff8800caef3fd8 0000000000014700 ffff88021d97bd20 0000000000000046
[2051760.239841]  ffff8800caef3b38 ffffffff8109dfbf ffffffffa03e14c0 ffffc9001b4ed4e0
[2051760.239848] Call Trace:
[2051760.239866]  [<ffffffff8109dfbf>] ? try_to_wake_up+0x1ff/0x2e0
[2051760.239925]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051760.239935]  [<ffffffff8109e102>] ? default_wake_function+0x12/0x20
[2051760.239942]  [<ffffffff810b14c5>] ? __wake_up_common+0x55/0x90
[2051760.239951]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051760.239958]  [<ffffffff81509dd4>] io_schedule+0x94/0xf0
[2051760.239975]  [<ffffffffa029415c>] cv_wait_common+0xac/0x1b0 [spl]
[2051760.240012]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051760.240019]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051760.240033]  [<ffffffffa02942b8>] __cv_wait_io+0x18/0x20 [spl]
[2051760.240071]  [<ffffffffa03e3343>] zio_wait+0x113/0x1d0 [zfs]
[2051760.240112]  [<ffffffffa0370bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[2051760.240158]  [<ffffffffa03894ed>] spa_sync+0x42d/0xb10 [zfs]
[2051760.240167]  [<ffffffff810d4868>] ? ktime_get_ts+0x48/0xf0
[2051760.240211]  [<ffffffffa039a202>] txg_sync_thread+0x382/0x5f0 [zfs]
[2051760.240258]  [<ffffffffa0399e80>] ? txg_delay+0xf0/0xf0 [zfs]
[2051760.240273]  [<ffffffffa028c2ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[2051760.240285]  [<ffffffffa028c250>] ? __thread_exit+0xa0/0xa0 [spl]
[2051760.240293]  [<ffffffff8108d3ca>] kthread+0xea/0x100
[2051760.240300]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051760.240308]  [<ffffffff815159bc>] ret_from_fork+0x7c/0xb0
[2051760.240315]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051760.240333] INFO: task rsync:13181 blocked for more than 120 seconds.
[2051760.240473]       Tainted: P           O  3.15.2-1-ARCH #1
[2051760.240645] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051760.240785] rsync           D 0000000000000000     0 13181  12650 0x00000000
[2051760.240792]  ffff88011ae47b68 0000000000000082 ffff88022431a8c0 0000000000014700
[2051760.240812]  ffff88011ae47fd8 0000000000014700 ffff88022431a8c0 0000000000000000
[2051760.240818]  0000000000000002 ffff88011ae47ab8 ffffffff81509e6b ffff8800cd40ea00
[2051760.240825] Call Trace:
[2051760.240833]  [<ffffffff81509e6b>] ? preempt_schedule+0x3b/0x60
[2051760.240842]  [<ffffffff8101e476>] ? ___preempt_schedule+0x56/0xb0
[2051760.240853]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051760.240866]  [<ffffffffa02941d5>] cv_wait_common+0x125/0x1b0 [spl]
[2051760.240872]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051760.240886]  [<ffffffffa0294275>] __cv_wait+0x15/0x20 [spl]
[2051760.240930]  [<ffffffffa0399c4b>] txg_wait_open+0x8b/0x110 [zfs]
[2051760.240967]  [<ffffffffa035c9a1>] dmu_tx_wait+0x2a1/0x2b0 [zfs]
[2051760.241002]  [<ffffffffa035ca3a>] dmu_tx_assign+0x8a/0x450 [zfs]
[2051760.241043]  [<ffffffffa03ce27d>] zfs_dirty_inode+0xad/0x2b0 [zfs]
[2051760.241051]  [<ffffffff8101e476>] ? ___preempt_schedule+0x56/0xb0
[2051760.241059]  [<ffffffff8150ceed>] ? _raw_spin_unlock_irqrestore+0x2d/0x30
[2051760.241066]  [<ffffffff810b16fd>] ? remove_wait_queue+0x4d/0x60
[2051760.241074]  [<ffffffff8135838e>] ? n_tty_write+0x2ce/0x560
[2051760.241111]  [<ffffffffa03ea4be>] zpl_dirty_inode+0xe/0x10 [zfs]
[2051760.241119]  [<ffffffff811df7a8>] __mark_inode_dirty+0x38/0x2d0
[2051760.241155]  [<ffffffffa03e8df7>] zpl_release+0x37/0x70 [zfs]
[2051760.241164]  [<ffffffff811b60ac>] __fput+0x9c/0x200
[2051760.241172]  [<ffffffff811b625e>] ____fput+0xe/0x10
[2051760.241180]  [<ffffffff8108a35f>] task_work_run+0x9f/0xe0
[2051760.241188]  [<ffffffff81014c55>] do_notify_resume+0x95/0xa0
[2051760.241196]  [<ffffffff81515ca0>] int_signal+0x12/0x17
[2051760.241202] INFO: task rsync:13183 blocked for more than 120 seconds.
[2051760.241343]       Tainted: P           O  3.15.2-1-ARCH #1
[2051760.241437] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051760.241568] rsync           D 0000000000000000     0 13183  13182 0x00000000
[2051760.241575]  ffff8801447c3ac0 0000000000000082 ffff8800222e8000 0000000000014700
[2051760.241581]  ffff8801447c3fd8 0000000000014700 ffff8800222e8000 0000000000000000
[2051760.241587]  0000000000000002 ffff8801447c3a10 ffffffff81509e6b ffff8800cd40ea00
[2051760.241593] Call Trace:
[2051760.241601]  [<ffffffff81509e6b>] ? preempt_schedule+0x3b/0x60
[2051760.241608]  [<ffffffff8101e476>] ? ___preempt_schedule+0x56/0xb0
[2051760.241618]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051760.241630]  [<ffffffffa02941d5>] cv_wait_common+0x125/0x1b0 [spl]
[2051760.241637]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051760.241649]  [<ffffffffa0294275>] __cv_wait+0x15/0x20 [spl]
[2051760.241693]  [<ffffffffa0399c4b>] txg_wait_open+0x8b/0x110 [zfs]
[2051760.241729]  [<ffffffffa035c9a1>] dmu_tx_wait+0x2a1/0x2b0 [zfs]
[2051760.241764]  [<ffffffffa035ca3a>] dmu_tx_assign+0x8a/0x450 [zfs]
[2051760.241805]  [<ffffffffa03d45d7>] zfs_write+0x3f7/0xc30 [zfs]
[2051760.241813]  [<ffffffff81400737>] ? kfree_skbmem+0x37/0xa0
[2051760.241821]  [<ffffffff81403c34>] ? consume_skb+0x34/0xa0
[2051760.241860]  [<ffffffffa03e94d1>] zpl_write_common+0x61/0xb0 [zfs]
[2051760.241898]  [<ffffffffa03e9580>] zpl_write+0x60/0x90 [zfs]
[2051760.241906]  [<ffffffff811b44fa>] vfs_write+0xba/0x1e0
[2051760.241913]  [<ffffffff811b441c>] ? vfs_read+0x13c/0x160
[2051760.241920]  [<ffffffff811b50f9>] SyS_write+0x59/0xd0
[2051760.241928]  [<ffffffff81515a69>] system_call_fastpath+0x16/0x1b
[2051760.241938] INFO: task java:13315 blocked for more than 120 seconds.
[2051760.242079]       Tainted: P           O  3.15.2-1-ARCH #1
[2051760.242173] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051760.242305] java            D 0000000000000001     0 13315  13210 0x00000000
[2051760.242311]  ffff880022a5ba20 0000000000000082 ffff8800cdc58a30 0000000000014700
[2051760.242317]  ffff880022a5bfd8 0000000000014700 ffff8800cdc58a30 0000000000000000
[2051760.242323]  ffff880022a5b990 ffffffff81097d0e 000000002fc14700 ffff88021d97a8c0
[2051760.242329] Call Trace:
[2051760.242337]  [<ffffffff81097d0e>] ? ttwu_stat+0x9e/0x110
[2051760.242345]  [<ffffffff8109dfbf>] ? try_to_wake_up+0x1ff/0x2e0
[2051760.242357]  [<ffffffffa02835e6>] ? trace_put_tcd+0x16/0x30 [spl]
[2051760.242369]  [<ffffffffa0283fd8>] ? spl_debug_msg+0x488/0x990 [spl]
[2051760.242376]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051760.242388]  [<ffffffffa02941d5>] cv_wait_common+0x125/0x1b0 [spl]
[2051760.242394]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051760.242407]  [<ffffffffa0294275>] __cv_wait+0x15/0x20 [spl]
[2051760.242450]  [<ffffffffa0399c4b>] txg_wait_open+0x8b/0x110 [zfs]
[2051760.242487]  [<ffffffffa035c9a1>] dmu_tx_wait+0x2a1/0x2b0 [zfs]
[2051760.242528]  [<ffffffffa03b6f9d>] ? zfs_dirent_unlock+0x9d/0x1a0 [zfs]
[2051760.242620]  [<ffffffffa03cf3fb>] zfs_create+0x2eb/0x780 [zfs]
[2051760.242637]  [<ffffffffa028797b>] ? kmem_alloc_debug+0x1cb/0x540 [spl]
[2051760.242675]  [<ffffffffa03ea279>] zpl_create+0xa9/0x1b0 [zfs]
[2051760.242684]  [<ffffffff8125437c>] ? security_inode_permission+0x1c/0x30
[2051760.242693]  [<ffffffff811c06cd>] vfs_create+0xad/0xf0
[2051760.242700]  [<ffffffff811c31f1>] do_last.isra.36+0xd31/0xe50
[2051760.242708]  [<ffffffff811c0daf>] ? path_init+0x17f/0x410
[2051760.242716]  [<ffffffff811c33d7>] path_openat+0xc7/0x6e0
[2051760.242724]  [<ffffffff8116f216>] ? handle_mm_fault+0xee6/0x10c0
[2051760.242732]  [<ffffffff811c4cfd>] do_filp_open+0x4d/0xc0
[2051760.242739]  [<ffffffff811d1fc7>] ? __alloc_fd+0xa7/0x130
[2051760.242747]  [<ffffffff811b389e>] do_sys_open+0x14e/0x250
[2051760.242754]  [<ffffffff811b39be>] SyS_open+0x1e/0x20
[2051760.242762]  [<ffffffff81515a69>] system_call_fastpath+0x16/0x1b
[2051880.284691] INFO: task txg_sync:561 blocked for more than 120 seconds.
[2051880.284847]       Tainted: P           O  3.15.2-1-ARCH #1
[2051880.284942] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[2051880.285076] txg_sync        D 0000000000000000     0   561      2 0x00000000
[2051880.285087]  ffff8800caef3bb0 0000000000000046 ffff88021d97bd20 0000000000014700
[2051880.285095]  ffff8800caef3fd8 0000000000014700 ffff88021d97bd20 0000000000000046
[2051880.285102]  ffff8800caef3b38 ffffffff8109dfbf ffffffffa03e14c0 ffffc9001b4ed4e0
[2051880.285109] Call Trace:
[2051880.285127]  [<ffffffff8109dfbf>] ? try_to_wake_up+0x1ff/0x2e0
[2051880.285185]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051880.285195]  [<ffffffff8109e102>] ? default_wake_function+0x12/0x20
[2051880.285202]  [<ffffffff810b14c5>] ? __wake_up_common+0x55/0x90
[2051880.285211]  [<ffffffff81509ae9>] schedule+0x29/0x70
[2051880.285218]  [<ffffffff81509dd4>] io_schedule+0x94/0xf0
[2051880.285234]  [<ffffffffa029415c>] cv_wait_common+0xac/0x1b0 [spl]
[2051880.285271]  [<ffffffffa03e14c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[2051880.285278]  [<ffffffff810b1bd0>] ? __wake_up_sync+0x20/0x20
[2051880.285292]  [<ffffffffa02942b8>] __cv_wait_io+0x18/0x20 [spl]
[2051880.285330]  [<ffffffffa03e3343>] zio_wait+0x113/0x1d0 [zfs]
[2051880.285371]  [<ffffffffa0370bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[2051880.285416]  [<ffffffffa03894ed>] spa_sync+0x42d/0xb10 [zfs]
[2051880.285425]  [<ffffffff810d4868>] ? ktime_get_ts+0x48/0xf0
[2051880.285470]  [<ffffffffa039a202>] txg_sync_thread+0x382/0x5f0 [zfs]
[2051880.285517]  [<ffffffffa0399e80>] ? txg_delay+0xf0/0xf0 [zfs]
[2051880.285531]  [<ffffffffa028c2ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[2051880.285544]  [<ffffffffa028c250>] ? __thread_exit+0xa0/0xa0 [spl]
[2051880.285551]  [<ffffffff8108d3ca>] kthread+0xea/0x100
[2051880.285558]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0
[2051880.285567]  [<ffffffff815159bc>] ret_from_fork+0x7c/0xb0
[2051880.285574]  [<ffffffff8108d2e0>] ? kthread_create_on_node+0x1a0/0x1a0

The machine eventually became responsive again and I rebooted it.

@behlendorf
Copy link
Contributor

It's hard to say for certain but it's likely this is a duplicate of #2523.

@ioquatix
Copy link

ioquatix commented Aug 7, 2014

Getting this systematically now. Seems to require that the system is switched on for about 24 hours and has been used for copying a moderate amount of data (e.g. 10+GBytes).

[64582.949621] INFO: task txg_sync:565 blocked for more than 120 seconds.
[64582.949782]       Tainted: P           O  3.15.8-1-ARCH #1
[64582.949876] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[64582.950007] txg_sync        D 0000000000000001     0   565      2 0x00000000
[64582.950062]  ffff8800ab607bb0 0000000000000046 ffff88021db95180 0000000000014700
[64582.950070]  ffff8800ab607fd8 0000000000014700 ffff88021db95180 0000000000000046
[64582.950077]  ffff8800ab607b38 ffffffff8109e03f ffffffffa03c94c0 ffffc90012907920
[64582.950084] Call Trace:
[64582.950102]  [<ffffffff8109e03f>] ? try_to_wake_up+0x1ff/0x2e0
[64582.950161]  [<ffffffffa03c94c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[64582.950171]  [<ffffffff8109e182>] ? default_wake_function+0x12/0x20
[64582.950178]  [<ffffffff810b1545>] ? __wake_up_common+0x55/0x90
[64582.950188]  [<ffffffff8150a5a9>] schedule+0x29/0x70
[64582.950195]  [<ffffffff8150a894>] io_schedule+0x94/0xf0
[64582.950212]  [<ffffffffa027c15c>] cv_wait_common+0xac/0x1b0 [spl]
[64582.950247]  [<ffffffffa03c94c0>] ? zio_taskq_member.isra.4+0x80/0x80 [zfs]
[64582.950254]  [<ffffffff810b1c50>] ? __wake_up_sync+0x20/0x20
[64582.950268]  [<ffffffffa027c2b8>] __cv_wait_io+0x18/0x20 [spl]
[64582.950303]  [<ffffffffa03cb343>] zio_wait+0x113/0x1d0 [zfs]
[64582.950344]  [<ffffffffa0358bf1>] dsl_pool_sync+0xc1/0x480 [zfs]
[64582.950414]  [<ffffffffa03714ed>] spa_sync+0x42d/0xb10 [zfs]
[64582.950423]  [<ffffffff810d49f8>] ? ktime_get_ts+0x48/0xf0
[64582.950467]  [<ffffffffa0382202>] txg_sync_thread+0x382/0x5f0 [zfs]
[64582.950513]  [<ffffffffa0381e80>] ? txg_delay+0xf0/0xf0 [zfs]
[64582.950527]  [<ffffffffa02742ca>] thread_generic_wrapper+0x7a/0x90 [spl]
[64582.950539]  [<ffffffffa0274250>] ? __thread_exit+0xa0/0xa0 [spl]
[64582.950547]  [<ffffffff8108d44a>] kthread+0xea/0x100
[64582.950555]  [<ffffffff8108d360>] ? kthread_create_on_node+0x1b0/0x1b0
[64582.950564]  [<ffffffff815164fc>] ret_from_fork+0x7c/0xb0
[64582.950570]  [<ffffffff8108d360>] ? kthread_create_on_node+0x1b0/0x1b0

Rebooting fixes the issue for a while.

zfs-git 0.6.3_r0_g07dabd2_3.15.8_1-1
zfs-utils-git 0.6.3_r0_g07dabd2_3.15.8_1-1

"Plenty" of free memory:

% free -h
             total       used       free     shared    buffers     cached
Mem:          7.8G       7.0G       815M       1.0M        22M       1.0G
-/+ buffers/cache:       6.0G       1.8G
Swap:         4.0G         0B       4.0G

@ioquatix
Copy link

ioquatix commented Aug 7, 2014

Some more details:

% cat /proc/spl/kstat/zfs/arcstats 
5 1 0x01 85 4080 7662692400 65263049125064
name                            type data
hits                            4    20891234
misses                          4    1208797
demand_data_hits                4    13670171
demand_data_misses              4    45926
demand_metadata_hits            4    7026611
demand_metadata_misses          4    990911
prefetch_data_hits              4    12297
prefetch_data_misses            4    52519
prefetch_metadata_hits          4    182155
prefetch_metadata_misses        4    119441
mru_hits                        4    1874691
mru_ghost_hits                  4    246906
mfu_hits                        4    18822145
mfu_ghost_hits                  4    685583
deleted                         4    613332
recycle_miss                    4    698978
mutex_miss                      4    778
evict_skip                      4    49180722
evict_l2_cached                 4    0
evict_l2_eligible               4    77391587328
evict_l2_ineligible             4    5868750848
hash_elements                   4    237737
hash_elements_max               4    273096
hash_collisions                 4    2011638
hash_chains                     4    70752
hash_chain_max                  4    11
p                               4    0
c                               4    2147483648
c_min                           4    4194304
c_max                           4    2147483648
size                            4    2069418736
hdr_size                        4    92223840
data_size                       4    92450304
meta_size                       4    716447232
other_size                      4    1168297360
anon_size                       4    180736
anon_evict_data                 4    0
anon_evict_metadata             4    0
mru_size                        4    617788928
mru_evict_data                  4    76446208
mru_evict_metadata              4    0
mru_ghost_size                  4    1529663488
mru_ghost_evict_data            4    713060352
mru_ghost_evict_metadata        4    816603136
mfu_size                        4    190927872
mfu_evict_data                  4    16000000
mfu_evict_metadata              4    32768
mfu_ghost_size                  4    513547264
mfu_ghost_evict_data            4    299622400
mfu_ghost_evict_metadata        4    213924864
l2_hits                         4    0
l2_misses                       4    0
l2_feeds                        4    0
l2_rw_clash                     4    0
l2_read_bytes                   4    0
l2_write_bytes                  4    0
l2_writes_sent                  4    0
l2_writes_done                  4    0
l2_writes_error                 4    0
l2_writes_hdr_miss              4    0
l2_evict_lock_retry             4    0
l2_evict_reading                4    0
l2_free_on_write                4    0
l2_abort_lowmem                 4    0
l2_cksum_bad                    4    0
l2_io_error                     4    0
l2_size                         4    0
l2_asize                        4    0
l2_hdr_size                     4    0
l2_compress_successes           4    0
l2_compress_zeros               4    0
l2_compress_failures            4    0
memory_throttle_count           4    0
duplicate_buffers               4    0
duplicate_buffers_size          4    0
duplicate_reads                 4    2
memory_direct_count             4    0
memory_indirect_count           4    4359
arc_no_grow                     4    0
arc_tempreserve                 4    0
arc_loaned_bytes                4    0
arc_prune                       4    612476
arc_meta_used                   4    1976968432
arc_meta_limit                  4    1610612736
arc_meta_max                    4    2020546568

@behlendorf
Copy link
Contributor

Can you dump the rest of the threads using sysrq-t when this happens again. One of the z_* threads should be handling some part of that IO and it would be useful to see what it's blocked on.

@TheLQ
Copy link

TheLQ commented Aug 24, 2014

I'm fairly certain I'm running into this issue. My ZFS only has 2GB of ram but loads the module with zfs_arc_max=1610612736 . Under heavy IO every 10-30 mins the entire pool locks for a minute or two. But afterwords it goes back to working.

Ubuntu 14.04
0.6.3-2~trusty

Right before it unlocks dmesg gets filled with this: https://gist.github.com/LordQuackstar/a05c1a0b4a166b3eb2fe

sysreq-t output in dmesg: https://gist.github.com/LordQuackstar/2e3517c5a3df0d755e40

@ioquatix
Copy link

I had this happen again, but the machine eventually recovered. Just updated to the latest arch release 0.6.3, will report back again. @behlendorf I'll try to catch it and give you the details you asked for.

@ioquatix
Copy link

@ryao
Copy link
Contributor

ryao commented Sep 13, 2014

@ioquatix Would you provide a description of your hardware and pool configuration?

@ioquatix
Copy link

H/W path       Device  Class       Description
==============================================
                       system      ProLiant MicroServer (658553-371)
/0                     bus         Motherboard
/0/0                   memory      64KiB BIOS
/0/3                   processor   (To Be Filled By O.E.M.)
/0/3/4                 memory      256KiB L1 cache
/0/3/5                 memory      2MiB L2 cache
/0/14                  memory      8GiB System Memory
/0/14/0                memory      4GiB DIMM Synchronous 1333 MHz (0.8 ns)
/0/14/1                memory      4GiB DIMM Synchronous 1333 MHz (0.8 ns)
/0/100                 bridge      RS880 Host Bridge
/0/100/2               bridge      RS780 PCI to PCI bridge (ext gfx port 0)
/0/100/2/0             display     Caicos [Radeon HD 6450/7450/8450 / R5 230 OEM]
/0/100/2/0.1           multimedia  Caicos HDMI Audio [Radeon HD 6400 Series]
/0/100/4               bridge      RS780/RS880 PCI to PCI bridge (PCIE port 0)
/0/100/4/0             storage     SiI 3132 Serial ATA Raid II Controller
/0/100/6               bridge      RS780 PCI to PCI bridge (PCIE port 2)
/0/100/6/0     enp3s0  network     NetXtreme BCM5723 Gigabit Ethernet PCIe
/0/100/11              storage     SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode]
/0/100/12              bus         SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
/0/100/12/1    usb4    bus         OHCI PCI host controller
/0/100/12/1/5          input       Generic USB K/B
/0/100/12.2            bus         SB7x0/SB8x0/SB9x0 USB EHCI Controller
/0/100/12.2/1  usb1    bus         EHCI Host Controller
/0/100/13              bus         SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
/0/100/13/1    usb5    bus         OHCI PCI host controller
/0/100/13.2            bus         SB7x0/SB8x0/SB9x0 USB EHCI Controller
/0/100/13.2/1  usb2    bus         EHCI Host Controller
/0/100/14              bus         SBx00 SMBus Controller
/0/100/14.1            storage     SB7x0/SB8x0/SB9x0 IDE Controller
/0/100/14.3            bridge      SB7x0/SB8x0/SB9x0 LPC host controller
/0/100/14.4            bridge      SBx00 PCI to PCI Bridge
/0/100/16              bus         SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
/0/100/16/1    usb6    bus         OHCI PCI host controller
/0/100/16.2            bus         SB7x0/SB8x0/SB9x0 USB EHCI Controller
/0/100/16.2/1  usb3    bus         EHCI Host Controller
/0/101                 bridge      Family 10h Processor HyperTransport Configuration
/0/102                 bridge      Family 10h Processor Address Map
/0/103                 bridge      Family 10h Processor DRAM Controller
/0/104                 bridge      Family 10h Processor Miscellaneous Control
/0/105                 bridge      Family 10h Processor Link Control

@ioquatix
Copy link

  pool: backup
 state: ONLINE
  scan: none requested
config:

    NAME                                        STATE     READ WRITE CKSUM
    backup                                      ONLINE       0     0     0
      ata-eSATA-2_ExternalRAID_WD-WCAZA6489285  ONLINE       0     0     0

errors: No known data errors

  pool: tank
 state: ONLINE
  scan: scrub repaired 0 in 30h18m with 0 errors on Sun Jun 15 05:44:19 2014
config:

    NAME                                          STATE     READ WRITE CKSUM
    tank                                          ONLINE       0     0     0
      raidz1-0                                    ONLINE       0     0     0
        ata-WDC_WD30EZRX-00MMMB0_WD-WCAWZ2263503  ONLINE       0     0     0
        ata-WDC_WD30EZRX-00MMMB0_WD-WMAWZ0353148  ONLINE       0     0     0
        ata-WDC_WD30EZRX-00MMMB0_WD-WCAWZ2144377  ONLINE       0     0     0
        ata-WDC_WD30EZRX-00MMMB0_WD-WMAWZ0365295  ONLINE       0     0     0

errors: No known data errors

@ryao
Copy link
Contributor

ryao commented Sep 15, 2014

@ioquatix I suspect that the kmem rework in openzfs/spl#369 and #2411 would have an effect here. I need to refresh that pull request as per @behlendorf's request. I will try to do that later today. Those requests resolve the atime issue that I described in my blog post as well as some other problems. It is conceivable that you are hitting one of them.

@ioquatix
Copy link

@ryao Once the update is available let me know and I will try it.

@behlendorf
Copy link
Contributor

Closing as stale.

@ioquatix
Copy link

ioquatix commented Jul 8, 2017

I'm almost completely convinced that the problems I was experience was due to faulty hardware.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants