ERROR: Read name line should start with '@' #491

qquuzhao · 2023-05-26T09:28:18Z

ERROR: Read name line should start with '@'，I used fastp 0.23.3 to deal my data. But when I use fastp.0.23.0,it's not occured this ERROR。I think this version is missing to handling of these abnormal reads.Looking forward to your response.

sfchen · 2023-05-26T10:57:32Z

Can you upload the data that failed?

erinyoung · 2023-05-26T18:06:31Z

I am so glad that someone else ran into this problem!

I have some examples because I'm running into the same issue with some paired-end SARS-CoV-2 files that I use for a lot of my testing.

They are located at:
https://raw.githubusercontent.com/StaPH-B/docker-builds/master/tests/SARS-CoV-2/SRR13957123_1.fastq.gz
https://raw.githubusercontent.com/StaPH-B/docker-builds/master/tests/SARS-CoV-2/SRR13957123_2.fastq.gz

nh13 · 2023-05-26T22:08:12Z

s10.R1.fq.gz
s10.R2.fq.gz

fastp \
        --in1 s10.R1.fq.gz \
        --in2 s10.R2.fq.gz \
        --out1 s10_1.fastp.fastq.gz \
        --out2 s10_2.fastp.fastq.gz \
        --json s10.fastp.json \
        --html s10.fastp.html \
         \
         \
         \
        --thread 2 \
        --detect_adapter_for_pe

fastp --version
fastp 0.23.3

The smoking gun is this:

$ file s10.R1.fq.gz
s10.R1.fq.gz: Blocked GNU Zip Format (BGZF; gzip compatible), block length 8883

When I decompress and recompress with gzip, it runs just fine (similarly with decompressed FASTQ).

I also decompressed and recompressed with bgzip, and it fails.

zerobio · 2023-05-30T10:00:32Z

I also encountered this problem. I guess whether the differences as shown out result in the error ?

sfchen · 2023-05-30T11:35:12Z

@zerobio rooted it as you pointed.

I will fix and update it soon.

#491

sfchen · 2023-05-30T11:53:42Z

Please try v0.23.4

shenwei356 · 2023-05-30T14:16:55Z

Binary at http://opengene.org/fastp/fastp is not updated yet.

sfchen · 2023-05-30T14:18:44Z

Yes, that will be updated tomorrow.

sfchen · 2023-05-31T01:27:31Z

Pre-built binary was just updated. But the conda version is still waiting for auto bump-up.

matthdsm · 2023-05-31T08:47:56Z

Can confirm this issue is fixed in the latest version!

qquuzhao · 2023-06-01T09:45:47Z

Thank you very much！Yes，the issue isfixed.

nh13 · 2023-06-01T19:02:40Z

Confirms it works on bgzip'ed FASTQs posted above, thank-you!

- \r\n handling may cause reading a byte past end of buffer, parser fails - checking end-of-file condition can only be reliably done after a call to getLine() returns NULL. One particular case is that some gzip files contain empty gzip blocks at the end of the file, which can´t be predicted by the current eof() code Tested with files provided in issue OpenGene#491. This reverts commit 0ee1b3b, "fix a regression bug of FASTQ reader"

erinyoung mentioned this issue May 26, 2023

Fastp version 0.23.4 StaPH-B/docker-builds#671

Merged

9 tasks

sfchen added a commit that referenced this issue May 30, 2023

fix a regression bug of FASTQ reader

0ee1b3b

#491

qquuzhao closed this as completed Jun 1, 2023

wdu mentioned this issue Jun 15, 2023

Fix for 2 bugs in the fastq reader: #497

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ERROR: Read name line should start with '@' #491

ERROR: Read name line should start with '@' #491

qquuzhao commented May 26, 2023

sfchen commented May 26, 2023

erinyoung commented May 26, 2023

nh13 commented May 26, 2023

zerobio commented May 30, 2023

sfchen commented May 30, 2023

sfchen commented May 30, 2023

shenwei356 commented May 30, 2023

sfchen commented May 30, 2023

sfchen commented May 31, 2023

matthdsm commented May 31, 2023

qquuzhao commented Jun 1, 2023

nh13 commented Jun 1, 2023

ERROR: Read name line should start with '@' #491

ERROR: Read name line should start with '@' #491

Comments

qquuzhao commented May 26, 2023

sfchen commented May 26, 2023

erinyoung commented May 26, 2023

nh13 commented May 26, 2023

zerobio commented May 30, 2023

sfchen commented May 30, 2023

sfchen commented May 30, 2023

shenwei356 commented May 30, 2023

sfchen commented May 30, 2023

sfchen commented May 31, 2023

matthdsm commented May 31, 2023

qquuzhao commented Jun 1, 2023

nh13 commented Jun 1, 2023