Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

high rate of reads unmapped: too short #1959

Open
z626093820 opened this issue Sep 28, 2023 · 5 comments
Open

high rate of reads unmapped: too short #1959

z626093820 opened this issue Sep 28, 2023 · 5 comments

Comments

@z626093820
Copy link

When I use STAR for alignment, many reads do not align. How can I resolve this? THANK YOU !

below is my parameter:

STAR --runThreadN 15 --runMode alignReads --quantMode TranscriptomeSAM GeneCounts --sjdbOverhang 149 --twopassMode Basic --outSAMtype BAM SortedByCoordinate --outBAMsortingThreadN 15 --limitSjdbInsertNsj 12115510 --genomeDir /storage/zuozd/genome/ipomoea_batatas_tai6/star-gffreadgtf149/ --readFilesIn /storage/zuozd/20230921bleeding_sap/00cleandata/${i}_1.clean.fq.gz /storage/zuozd/20230921bleeding_sap/00cleandata/${i}_1.clean.fq.gz --readFilesCommand zcat --outFileNamePrefix ${i}

image

@z626093820
Copy link
Author

When I use hisat2 comparison software, the comparison rate is high, why is it
6e1419b13d89744fc5f7ebaec9ac519

@alexdobin
Copy link
Owner

It could be an issue with pairing of reads in FASTQ files. You can test it by mapping Read1 and Read2 separately.

@z626093820
Copy link
Author

z626093820 commented Oct 13, 2023

It could be an issue with pairing of reads in FASTQ files. You can test it by mapping Read1 and Read2 separately.

When I map the Read1 and Read2 files separately, the map rate of the two files reaches more than 80%,how can i slove the problem about the pairing of reads in FASTQ files? THANK YOU

@alexdobin
Copy link
Owner

If you are trimming reads before mapping, I would try to map untrimmed (raw) reads.

@z626093820
Copy link
Author

If you are trimming reads before mapping, I would try to map untrimmed (raw) reads.

When I analyze the new command parameters given by the sequencing company, the mapping rate has reached 70%. Excuse me, which of these parameters have played a key role?THANK YOU!

the low mapping rate command :
STAR --runThreadN 15 --runMode alignReads --quantMode TranscriptomeSAM GeneCounts --sjdbOverhang 149 --twopassMode Basic --outSAMtype BAM SortedByCoordinate --outBAMsortingThreadN 15 --limitSjdbInsertNsj 12115510 --genomeDir /storage/zuozd/genome/ipomoea_batatas_tai6/star-gffreadgtf149/ --readFilesIn /storage/zuozd/20230921bleeding_sap/00cleandata/${i}_1.clean.fq.gz /storage/zuozd/20230921bleeding_sap/00cleandata/${i}_1.clean.fq.gz --readFilesCommand zcat --outFileNamePrefix ${i}

the high mapping rate command:
STAR --runThreadN 15 --runMode alignReads --readFilesCommand zcat --quantMode GeneCounts --outSAMtype BAM SortedByCoordinate --outFilterType BySJout --outFilterMultimapNmax 20 --alignSJoverhangMin 8 --alignSJDBoverhangMin 1 --alignIntronMin 20 --alignIntronMax 1000000 --alignMatesGapMax 1000000 --outFilterMismatchNmax 2 --outReadsUnmapped Fastx --twopassMode Basic --chimSegmentMin 12 --chimJunctionOverhangMin 12 --chimSegmentReadGapMax 3 --alignSJstitchMismatchNmax 5 -1 5 5 --chimOutJunctionFormat 1 --genomeDir /storage/zuozd/genome/ipomoea_batatas_tai6/star-gffreadgtf149/ --readFilesIn /storage/zuozd/20230921bleeding_sap/00cleandata/L58-1_1.clean.fq.gz /storage/zuozd/20230921bleeding_sap/00cleandata/L58-1_2.clean.fq.gz --outFileNamePrefix L58-1

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants