Some confusion about the seq_split and the paried fasta file #42

AntetokounmJie · 2020-07-30T12:34:05Z

Hi sir,

I use seqkit to filt out the sequence that contain "N" in my short reads file.

But i ignore the "paired" issue at the first.

I just use the filtered-not-totally-paired short reads to polish my genome.

The seq_split is working fine with the filtered-not-totally-paired short reads.

There is no bug info during the whole analysis.

But i am not sure that whether this kind of file can generate the right polished genome.

Do i need to filter out the unpaired reads after i filter out the containing "N" reads to get the good-paired NGS_1.fq and NGS_2.fq ?

Thanks a lot.

moold · 2020-07-31T00:59:44Z

Yes, you should filter out the unpaired reads. seq_split dose not check whether your input files are legal if you set paired option.

AntetokounmJie · 2020-07-31T01:21:37Z

Yes, you should filter out the unpaired reads. seq_split dose not check whether your input files are legal if you set paired option.

Thanks a lot
Have a nice day : )

AntetokounmJie closed this as completed Jul 31, 2020

Provide feedback