Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reads are good but throws error: "Sequence/quality scores do not match" #12

Closed
pjaborges opened this issue Oct 30, 2019 · 4 comments
Closed

Comments

@pjaborges
Copy link

I try GNmerge in Linux but is not running with my simulated datasets.

The header patterns are the following:

@gi|110798562|ref|NC_008261.1|-100.101.325660/1
AAGTTCATCATAGTTATTTTGAATAAAATTTAATCTATCAAGTATCATCTATTATCACTCCGTATACAGATTTTCATATTTTACAATTATAGCACACTAC
+
>G9GFGCFGGGG#G#8#G)E##G6GGGBGGGGCGGGGGEFGG8GG:CGGF9,G9EGGGFGGGGGG6GGGGFGGGGCGGGGFGGGGGGGGGGGGGGFGGCG
@gi|110798562|ref|NC_008261.1|-100.101.325660/2
TAGTAGTGGGCTCTCTTTGTAAAATATAAACATCCGTATACGGAGTGATAATAGATTATACTTGATAGATTAAATTTTATTGAAAATAAATATGATGAAC
+
C2C*G*5)(*@4(G##:GGGF4G3,*D*#G#G(G#G*E05GGGGGG+.E+*5DGFG*4G8G1G+G+*GG87CGGCFGEG0FGCGFGGG+GGGGGGGGGGF

My version seems to expect a " " as delimiter to create a single key. Thus, I was getting the error : ..... ": not matched in input files"
I add a " " before the "/" and it solve the issue. I, notice after that a new parameter (-t) was added to handle these situations.

After, another error prompted: "Sequence/quality scores do not match". This is thrown because of "ERRQUAL". The reads do not have any issue and I have been able to run the datasets with many other tools (BBMerge, USEARCH, FLASH, PEAR, etc...)

I am sharing a small dataset, in case you want to investigate what could be the problem?
reads_NC_008261.1.100.101.10_R1.fq.gz
reads_NC_008261.1.100.101.10_R2.fq.gz

Thanks

@jsh58
Copy link
Owner

jsh58 commented Oct 31, 2019

Thanks for the question, and for including the dataset.

Unfortunately, I cannot reproduce the error. Here is the command and verbose output:

$ ./NGmerge \
  -1 reads_NC_008261.1.100.101.10_R1.fq.gz \
  -2 reads_NC_008261.1.100.101.10_R2.fq.gz \
  -o merged \
  -t '/' \
  -v
Processing files: reads_NC_008261.1.100.101.10_R1.fq.gz,reads_NC_008261.1.100.101.10_R2.fq.gz
  Fragments (pairs of reads) analyzed: 162830
  Successfully stitched: 126562

As you can see, using -t '/' solves the header issue. I wonder if, when you were dealing with the header issue, you substituted ' ' before every /, thus possibly introducing spaces in some of the quality score strings.

@pjaborges
Copy link
Author

Yes, you are correct.
I introduced spaces in the quality score strings when doing the replace.

It worked perfectly!
Thanks

@Kaderi15
Copy link

Kaderi15 commented Feb 22, 2021

Dear Sir
I got the "Error! @ERR3385738.1.1: not matched in input files" when run NGmerge to merge sequences. Please help me, sir. Sir I have attached my sample sequence files herewith.
Command: $ ./NGmerge -1 CDTC160004_R1.fastq.gz -2 CDTC160004_R2.fastq.gz -t ' ' -o merged -v

CDTC160075_S95_L001_R1_trim.fastq.gz

CDTC160075_S95_L001_R2_trim.fastq.gz

@jsh58
Copy link
Owner

jsh58 commented Feb 22, 2021

Please open a New Issue. This is unrelated to #12

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants