Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

umount fails because of stale loopback during interrupted run of ltp. #28

Open
stevenjswanson opened this issue Jul 3, 2017 · 2 comments
Labels

Comments

@stevenjswanson
Copy link
Member

To reproduce, mount NOVA fs on /mnt/ramdisk

Then run

 /opt/ltp/runltp -f nova -d /mnt/ramdisk

and hit ctrl-C after a while. Try to unmount:

umount /mnt/ramdisk

if fails, try

losetup -d /dev/loop*
umount /mnt/ramdisk

If fails, that's the bug.

This shows up on console:

[ 32.256504] Oops: 0010 [#1] SMP
[ 32.257489] Modules linked in: nova(OE) libcrc32c rfcomm bnep coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel vmw_balloon aes_x86_64 crypto_simd dax_pmem cryptd glue_helper intel_rapl_perf snd_ens1371 gameport snd_ac97_codec ac97_bus snd_pcm snd_seq_midi snd_seq_midi_event joydev input_leds uvcvideo dax serio_raw snd_rawmidi nd_pmem nd_btt videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev snd_seq snd_timer snd_seq_device btusb btrtl media btbcm btintel bluetooth snd soundcore nfit vmw_vsock_vmci_transport vsock shpchp i2c_piix4 vmw_vmci mac_hid parport_pc ppdev lp parport ip_tables x_tables autofs4 hid_generic usbhid hid psmouse vmwgfx drm_kms_helper ahci syscopyarea libahci sysfillrect e1000 sysimgblt fb_sys_fops ttm mptspi drm scsi_transport_spi
[ 32.279592] mptscsih pata_acpi mptbase fjes
[ 32.280914] CPU: 0 PID: 2746 Comm: loop0 Tainted: G OE 4.10.0-nova #8
[ 32.282976] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 07/02/2015
[ 32.285974] task: ffff995cc760da00 task.stack: ffffbca844b40000
[ 32.287649] RIP: 0010: (null)
[ 32.288838] RSP: 0018:ffffbca844b43d20 EFLAGS: 00010246
[ 32.290605] RAX: ffffffffc067ca80 RBX: ffff995cf3043fc0 RCX: 0000000000000001
[ 32.292616] RDX: ffff995cce468888 RSI: ffffbca844b43d68 RDI: ffff995cf3043fc0
[ 32.294925] RBP: ffffbca844b43dc0 R08: 0000000000001000 R09: ffff995cf3043f78
[ 32.297431] R10: 0000000000001000 R11: ffff995cce468888 R12: 0000000000000000
[ 32.300188] R13: 0000000000000000 R14: ffff995cce468888 R15: 0000000000000001
[ 32.302801] FS: 0000000000000000(0000) GS:ffff995cf9600000(0000) knlGS:0000000000000000
[ 32.305722] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 32.307547] CR2: 0000000000000000 CR3: 00000000add03000 CR4: 00000000003406f0
[ 32.309893] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 32.311959] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 32.314002] Call Trace:
[ 32.314736] ? lo_rw_aio+0x13c/0x290
[ 32.315726] ? update_curr+0xf3/0x170
[ 32.316810] loop_queue_work+0x938/0xac0
[ 32.317946] ? pick_next_task_fair+0x10d/0x4b0
[ 32.319171] ? __switch_to+0x23c/0x520
[ 32.320217] ? __schedule+0x230/0x6b0
[ 32.321295] kthread_worker_fn+0xf6/0x1c0
[ 32.322487] kthread+0x101/0x140
[ 32.323504] ? kthread_create_worker_on_cpu+0x70/0x70
[ 32.325156] ? kthread_create_on_node+0x60/0x60
[ 32.326747] ret_from_fork+0x2c/0x40
[ 32.327882] Code: Bad RIP value.
[ 32.328856] RIP: (null) RSP: ffffbca844b43d20
[ 32.330487] CR2: 0000000000000000
[ 32.331366] ---[ end trace c3c35d5ea58174ef ]---

@stevenjswanson
Copy link
Member Author

This is on a small VM, with just 2GB for NOVA.

@stevenjswanson
Copy link
Member Author

Seems likely this is related somehow to #40.

juno-kim pushed a commit that referenced this issue Oct 15, 2018
Crash dump shows following instructions

crash> bt
PID: 0      TASK: ffffffffbe412480  CPU: 0   COMMAND: "swapper/0"
 #0 [ffff891ee0003868] machine_kexec at ffffffffbd063ef1
 #1 [ffff891ee00038c8] __crash_kexec at ffffffffbd12b6f2
 #2 [ffff891ee0003998] crash_kexec at ffffffffbd12c84c
 #3 [ffff891ee00039b8] oops_end at ffffffffbd030f0a
 #4 [ffff891ee00039e0] no_context at ffffffffbd074643
 #5 [ffff891ee0003a40] __bad_area_nosemaphore at ffffffffbd07496e
 #6 [ffff891ee0003a90] bad_area_nosemaphore at ffffffffbd074a64
 #7 [ffff891ee0003aa0] __do_page_fault at ffffffffbd074b0a
 #8 [ffff891ee0003b18] do_page_fault at ffffffffbd074fc8
 #9 [ffff891ee0003b50] page_fault at ffffffffbda01925
    [exception RIP: qlt_schedule_sess_for_deletion+15]
    RIP: ffffffffc02e526f  RSP: ffff891ee0003c08  RFLAGS: 00010046
    RAX: 0000000000000000  RBX: 0000000000000000  RCX: ffffffffc0307847
    RDX: 00000000000020e6  RSI: ffff891edbc377c8  RDI: 0000000000000000
    RBP: ffff891ee0003c18   R8: ffffffffc02f0b20   R9: 0000000000000250
    R10: 0000000000000258  R11: 000000000000b780  R12: ffff891ed9b43000
    R13: 00000000000000f0  R14: 0000000000000006  R15: ffff891edbc377c8
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
 #10 [ffff891ee0003c20] qla2x00_fcport_event_handler at ffffffffc02853d3 [qla2xxx]
 #11 [ffff891ee0003cf0] __dta_qla24xx_async_gnl_sp_done_333 at ffffffffc0285a1d [qla2xxx]
 #12 [ffff891ee0003de8] qla24xx_process_response_queue at ffffffffc02a2eb5 [qla2xxx]
 #13 [ffff891ee0003e88] qla24xx_msix_rsp_q at ffffffffc02a5403 [qla2xxx]
 #14 [ffff891ee0003ec0] __handle_irq_event_percpu at ffffffffbd0f4c59
 #15 [ffff891ee0003f10] handle_irq_event_percpu at ffffffffbd0f4e02
 #16 [ffff891ee0003f40] handle_irq_event at ffffffffbd0f4e90
 #17 [ffff891ee0003f68] handle_edge_irq at ffffffffbd0f8984
 #18 [ffff891ee0003f88] handle_irq at ffffffffbd0305d5
 #19 [ffff891ee0003fb8] do_IRQ at ffffffffbda02a18
 --- <IRQ stack> ---
 #20 [ffffffffbe403d30] ret_from_intr at ffffffffbda0094e
    [exception RIP: unknown or invalid address]
    RIP: 000000000000001f  RSP: 0000000000000000  RFLAGS: fff3b8c2091ebb3f
    RAX: ffffbba5a0000200  RBX: 0000be8cdfa8f9fa  RCX: 0000000000000018
    RDX: 0000000000000101  RSI: 000000000000015d  RDI: 0000000000000193
    RBP: 0000000000000083   R8: ffffffffbe403e38   R9: 0000000000000002
    R10: 0000000000000000  R11: ffffffffbe56b820  R12: ffff891ee001cf00
    R13: ffffffffbd11c0a4  R14: ffffffffbe403d60  R15: 0000000000000001
    ORIG_RAX: ffff891ee0022ac0  CS: 0000  SS: ffffffffffffffb9
 bt: WARNING: possibly bogus exception frame
 #21 [ffffffffbe403dd8] cpuidle_enter_state at ffffffffbd67c6fd
 #22 [ffffffffbe403e40] cpuidle_enter at ffffffffbd67c907
 #23 [ffffffffbe403e50] call_cpuidle at ffffffffbd0d98f3
 #24 [ffffffffbe403e60] do_idle at ffffffffbd0d9b42
 #25 [ffffffffbe403e98] cpu_startup_entry at ffffffffbd0d9da3
 #26 [ffffffffbe403ec0] rest_init at ffffffffbd81d4aa
 #27 [ffffffffbe403ed0] start_kernel at ffffffffbe67d2ca
 #28 [ffffffffbe403f28] x86_64_start_reservations at ffffffffbe67c675
 #29 [ffffffffbe403f38] x86_64_start_kernel at ffffffffbe67c6eb
 #30 [ffffffffbe403f50] secondary_startup_64 at ffffffffbd0000d5

Fixes: 040036b ("scsi: qla2xxx: Delay loop id allocation at login")
Cc: <stable@vger.kernel.org> # v4.17+
Signed-off-by: Chuck Anderson <chuck.anderson@oracle.com>
Signed-off-by: Himanshu Madhani <himanshu.madhani@cavium.com>
Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
yh-raphael pushed a commit to yh-raphael/linux-nova that referenced this issue Aug 25, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant