Help! My reads aren't aligning! #118
kclem
started this conversation in
Troubleshooting
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In your use of CRISPResso, you may come across some samples with low alignment rates. This guide will cover what could be causing the low alignment rates, and some troubleshooting steps to help you figure out what is going wrong.
A high alignment rate is critical for accurate CRISPR editing quantification. For this reason, the first figure in the report shows the alignment rate:
The left bar shows the number of reads in the input fastq.
The center bar shows the number of reads after preprocessing (adapter trimming (using Trimmomatic), read merging (using FLASH), and quality filtering). CRISPResso will attempt to align these reads to the amplicon sequences.
--trimmomatic_options_string
setting in CRISPResso. In particular, theMINLEN
andAVGQUAL
Trimmomatic settings are used, reads that are too short or have low quality are discarded.--auto
mode with no amplicon sequence).--min_paired_end_reads_overlap
(default: 10 for a 10bp minimum overlap between R1 and R2),--max_paired_end_reads_overlap
(default: 100 for a maximum 100bp overlap between R1 and R2) and--stringent_flash_merging
(default: false for a lenient flash merging threshold).--min_average_read_quality
,--min_single_bp_quality
. By default, no quality filters are applied.The right bar shows the number of reads that were successfully aligned to an amplicon in CRISPResso. By default, reads must have 60% homology to a reference sequence for successful alignment, meaning that 60% of the bases must match. So if your amplicon sequence is 100bp long, reads containing deletions more than 40bp long will be discarded.
--auto
parameter to detect possible reference amplicon sequences.--split_interleaved_input
.--default_min_aln_score
(default 60 for a minimum of 60% homology between the read and the reference sequence) and--amplicon_min_alignment_score
(default: 60 for a minimum of 60% homology between the read and the reference sequence. This parameter is a comma-separated string with a value for each amplicon (order given by comma-separated amplicon sequences)). The--default_min_aln_score
setting is applied to all amplicons unless they are specifically set to another value using the--amplicon_min_alignment_score
parameter.--default_min_aln_score 0
and--fastq_output
. This will report all alignments (even those with 0 homology) and produce the output file CRISPResso_output.fastq.gz with information for each read in the input fastq including the alignment scores to each amplicon. Use this file to determine whether reads discarded at the 60% threshold should be accepted, and what an acceptable--default_min_aln_score
would be.Beta Was this translation helpful? Give feedback.
All reactions