Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"ERROR - runtime: error when scanning drive [Input/output error]" instead deetcting bad block #29

Open
marcin-github opened this issue Nov 12, 2020 · 7 comments

Comments

@marcin-github
Copy link

marcin-github commented Nov 12, 2020

Hello!
I'd like to ask you how to deal with situation like below.
I run bbf against HDD, hdd is connected to motherboard using controller H310 flashed to IT firmware (

01:00.0 RAID bus controller: Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] (rev 03)
        Subsystem: Dell SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon]
        Kernel driver in use: mpt3sas
        Kernel modules: mpt3sas

)
Hdd has non zero values in Offline_Uncorrectable, Current_Pending_Sector and Reallocated_Sector_Ct fields.
In dmesg I have:

[26058.722312] mpt2sas_cm0: log_info(0x31080000): originator(PL), code(0x08), sub_code(0x0000)
[26058.722408] sd 9:0:1:0: [sde] tag#2800 UNKNOWN(0x2003) Result: hostbyte=0x0b driverbyte=0x00 cmd_age=2s
[26058.722487] sd 9:0:1:0: [sde] tag#2800 CDB: opcode=0x28 28 00 bb c8 cf 28 00 01 00 00
[26058.722563] blk_update_request: I/O error, dev sde, sector 3150499624 op 0x0:(READ) flags 0x80700 phys_seg 17 prio class 0
[26058.722660] sd 9:0:1:0: [sde] tag#2799 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 cmd_age=2s
[26058.722738] sd 9:0:1:0: [sde] tag#2799 Sense Key : 0x3 [current]
[26058.722795] sd 9:0:1:0: [sde] tag#2799 ASC=0x11 ASCQ=0x0
[26058.722852] sd 9:0:1:0: [sde] tag#2799 CDB: opcode=0x28 28 00 bb c8 ce 28 00 01 00 00
[26058.722927] blk_update_request: critical medium error, dev sde, sector 3150499517 op 0x0:(READ) flags 0x80700 phys_seg 7 prio class 0
[26061.096262] mpt2sas_cm0: log_info(0x31080000): originator(PL), code(0x08), sub_code(0x0000)
[26061.096345] mpt2sas_cm0: log_info(0x31080000): originator(PL), code(0x08), sub_code(0x0000)
[26061.096420] mpt2sas_cm0: log_info(0x31080000): originator(PL), code(0x08), sub_code(0x0000)
[26061.096499] mpt2sas_cm0: log_info(0x31080000): originator(PL), code(0x08), sub_code(0x0000)
[26061.096571] mpt2sas_cm0: log_info(0x31080000): originator(PL), code(0x08), sub_code(0x0000)
[26061.096644] mpt2sas_cm0: log_info(0x31080000): originator(PL), code(0x08), sub_code(0x0000)
[26061.096730] sd 9:0:1:0: [sde] tag#2757 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 cmd_age=2s
[26061.096806] sd 9:0:1:0: [sde] tag#2757 Sense Key : 0x3 [current]
[26061.096861] sd 9:0:1:0: [sde] tag#2757 ASC=0x11 ASCQ=0x0
[26061.096916] sd 9:0:1:0: [sde] tag#2757 CDB: opcode=0x28 28 00 bb c8 ce bd 00 00 01 00
[26061.096988] blk_update_request: critical medium error, dev sde, sector 3150499517 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[26061.097070] Buffer I/O error on dev sde2, logical block 1003013821, async page read

kernel 5.9.0

Is t something you can improve in bbf, maybe I should native SATA controller to connect hdd to motherboard? Maybe something else?

@trapexit
Copy link
Owner

Curious. bbf is issuing bog standard ATA commands as far as I know. I'll have to investigate more.

@r2binx
Copy link

r2binx commented Nov 30, 2020

I'm getting the same error message but my dmesg looks a bit different. Disk is also having Offline_Uncorrectable and Current_Pending_Sector values >0. It's plugged in straight to a mainboard sata port (intel z370 chipset).

[126894.900145] ata1.00: exception Emask 0x40 SAct 0x6 SErr 0x80800 action 0x6
[126894.900146] ata1.00: irq_stat 0x40000008
[126894.900147] ata1: SError: { HostInt 10B8B }
[126894.900149] ata1.00: failed command: READ FPDMA QUEUED
[126894.900151] ata1.00: cmd 60/00:08:e0:a7:61/01:00:0c:00:00/40 tag 1 ncq dma 131072 in
                         res 41/40:00:b8:a8:61/00:00:0c:00:00/00 Emask 0x449 (internal error) <F>
[126894.900151] ata1.00: status: { DRDY ERR }
[126894.900152] ata1.00: error: { UNC }
[126894.900153] ata1.00: failed command: READ FPDMA QUEUED
[126894.900155] ata1.00: cmd 60/00:10:e0:a8:61/01:00:0c:00:00/40 tag 2 ncq dma 131072 in
                         res 40/00:08:e0:a7:61/00:00:0c:00:00/40 Emask 0x40 (internal error)
[126894.900155] ata1.00: status: { DRDY }
[126894.900157] ata1: hard resetting link
[126895.214707] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[126895.215463] ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT0._GTF.DSSP], AE_NOT_FOUND (20200717/psargs-330)
[126895.215483] ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT0._GTF due to previous error (AE_NOT_FOUND) (20200717/psparse-529)
[126895.216580] ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT0._GTF.DSSP], AE_NOT_FOUND (20200717/psargs-330)
[126895.216600] ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT0._GTF due to previous error (AE_NOT_FOUND) (20200717/psparse-529)
[126895.216848] ata1.00: configured for UDMA/133
[126895.216874] sd 0:0:0:0: [sda] tag#1 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE cmd_age=4s
[126895.216879] sd 0:0:0:0: [sda] tag#1 Sense Key : Medium Error [current]
[126895.216884] sd 0:0:0:0: [sda] tag#1 Add. Sense: Unrecovered read error - auto reallocate failed
[126895.216889] sd 0:0:0:0: [sda] tag#1 CDB: Read(16) 88 00 00 00 00 00 0c 61 a7 e0 00 00 01 00 00 00
[126895.216895] blk_update_request: I/O error, dev sda, sector 207726776 op 0x0:(READ) flags 0x80700 phys_seg 3 prio class 0
[126895.216952] sd 0:0:0:0: [sda] tag#2 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE cmd_age=4s
[126895.216956] sd 0:0:0:0: [sda] tag#2 Sense Key : Illegal Request [current]
[126895.216960] sd 0:0:0:0: [sda] tag#2 Add. Sense: Unaligned write command
[126895.216965] sd 0:0:0:0: [sda] tag#2 CDB: Read(16) 88 00 00 00 00 00 0c 61 a8 e0 00 00 01 00 00 00
[126895.216969] blk_update_request: I/O error, dev sda, sector 207726816 op 0x0:(READ) flags 0x80700 phys_seg 15 prio class 0
[126895.216982] ata1: EH complete
[126899.772402] ata1.00: exception Emask 0x0 SAct 0x1fe000 SErr 0x0 action 0x0
[126899.772406] ata1.00: irq_stat 0x40000008
[126899.772411] ata1.00: failed command: READ FPDMA QUEUED
[126899.772421] ata1.00: cmd 60/01:68:b8:a8:61/00:00:0c:00:00/40 tag 13 ncq dma 512 in
                         res 41/40:00:b8:a8:61/00:00:0c:00:00/00 Emask 0x409 (media error) <F>
[126899.772424] ata1.00: status: { DRDY ERR }
[126899.772426] ata1.00: error: { UNC }
[126899.774080] ata1.00: configured for UDMA/133
[126899.774106] sd 0:0:0:0: [sda] tag#13 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE cmd_age=4s
[126899.774113] sd 0:0:0:0: [sda] tag#13 Sense Key : Medium Error [current]
[126899.774118] sd 0:0:0:0: [sda] tag#13 Add. Sense: Unrecovered read error - auto reallocate failed
[126899.774123] sd 0:0:0:0: [sda] tag#13 CDB: Read(16) 88 00 00 00 00 00 0c 61 a8 b8 00 00 00 01 00 00
[126899.774129] blk_update_request: I/O error, dev sda, sector 207726776 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[126899.774137] Buffer I/O error on dev sda, logical block 207726776, async page read
[126899.774214] ata1: EH complete
[126904.081608] ata1.00: exception Emask 0x0 SAct 0x7f SErr 0x0 action 0x0
[126904.081613] ata1.00: irq_stat 0x40000008
[126904.081617] ata1.00: failed command: READ FPDMA QUEUED
[126904.081627] ata1.00: cmd 60/01:00:b9:a8:61/00:00:0c:00:00/40 tag 0 ncq dma 512 in
                         res 41/40:00:b9:a8:61/00:00:0c:00:00/00 Emask 0x409 (media error) <F>
[126904.081630] ata1.00: status: { DRDY ERR }
[126904.081632] ata1.00: error: { UNC }
[126904.083341] ata1.00: configured for UDMA/133
[126904.083366] sd 0:0:0:0: [sda] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE cmd_age=8s
[126904.083372] sd 0:0:0:0: [sda] tag#0 Sense Key : Medium Error [current]
[126904.083377] sd 0:0:0:0: [sda] tag#0 Add. Sense: Unrecovered read error - auto reallocate failed
[126904.083382] sd 0:0:0:0: [sda] tag#0 CDB: Read(16) 88 00 00 00 00 00 0c 61 a8 b9 00 00 00 01 00 00
[126904.083388] blk_update_request: I/O error, dev sda, sector 207726777 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[126904.083395] Buffer I/O error on dev sda, logical block 207726777, async page read
[126904.083464] ata1: EH complete

@trapexit
Copy link
Owner

That's a very different error. As it says:

Sense: Unrecovered read error - auto reallocate failed
blk_update_request: I/O error, dev sda, sector 207726777 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0

How many logical blocks is the device? It's saying that 207726777 is bad. The kernel always lists errors it gets when the device reports it.

@r2binx
Copy link

r2binx commented Nov 30, 2020

I'm sorry with the same error message I meant that bbf aborts the scan with "ERROR - runtime: error when scanning drive [Input/output error]" as well when it reaches the faulty sector.
It's 11721045168 blocks at a size of 512B.

@trapexit
Copy link
Owner

What settings are you using?

@r2binx
Copy link

r2binx commented Nov 30, 2020

I wanted to scan for all bad blocks (SMART reports multiple) with bbf scan /dev/sda

@trapexit
Copy link
Owner

OK. There is a big difference between -t ata and -t os. It defaults to os.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants