Skip to content
xbdr86 edited this page Jun 8, 2018 · 6 revisions

The motif-consensus.fa file will have the following structure:

>miRNA_name miRNA_motif
miRNA_consensus_sequence

>hsa-miR-21-5p       TTATCAGACTGAT
TAGCTTATCAGACTGATGTTGA

where miRNA_motif will be the unique n-mer sequence that characterizes the uniquely the miRNA, and miRNA_consensus_sequence the canonical sequence of the miRNA we are comparing all isomiR to.

The following graph represents the percentage of unique motifs that can be generated from miRBase using different n-mer motif sequences in the center: miRBase coverage

In order to increase sensitivity to RNA editing events that may impact the miRNA sequence, the miRNA_motif takes into account the inclusion of ambiguous nucleotides.

Code Bases
R A or G
Y C or T
S G or C
W A or T
K G or T
M A or C
B C or G or T
D A or G or T
H A or C or T
V A or C or G
N any base

To activate this option, change the option ambiguous_letters to True in the config.yaml file. Running QuagmiR on this mode increases its runtime. The activation of this mode also allows the matching of eventual sequence with ambiguous nucleotides from the fastq file.

See on how to design unique motifs from miRBase entries