You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for developping Racon, I use it all the time 😃
I am polishing amplicon sequences (they are short, usually less than 1000bp), and I noticed that racon sometimes introduced small indels at position 500 of the consensus, which I suppose is linked with the fact that 500 is the size of the default window. Do you know where this issue might come from ?
Around position 500 of polished_0.fasta I obtain the sequence "TGTGCAGATTTTTGACAA", which is in none of the reads and should instead be "TGTGTGCGATTTTTGACAA".
Thanks in advance
The text was updated successfully, but these errors were encountered:
Hi,
You are correct. There is unfortunately a side effect of windowing, if the window boundary happens to fall on an indel region.
There are a couple of options you may try:
Run Racon twice, either with the same (default) window size, or with a slightly different window size. If your input data has a bias towards insertions or deletions, then your consensus sequence will change in length, and the window boundaries in the second round should be different than in the first round and you can just run it with the same default window size. If your consensus does not vary much in length, try using a slightly larger/smaller window size for the second round.
Since your target sequences are only ~1000bp in length, you can try to bump up the window size to 1000bp or more, and produce a consensus as a single window. That way you will avoid windowing issues altogether.
Hello,
Thanks for developping Racon, I use it all the time 😃
I am polishing amplicon sequences (they are short, usually less than 1000bp), and I noticed that racon sometimes introduced small indels at position 500 of the consensus, which I suppose is linked with the fact that 500 is the size of the default window. Do you know where this issue might come from ?
Attached is one very small example:
racon_pb.gz
I have v1.4.20 and here is the command lines used:
Around position 500 of polished_0.fasta I obtain the sequence "TGTGCAGATTTTTGACAA", which is in none of the reads and should instead be "TGTGTGCGATTTTTGACAA".
Thanks in advance
The text was updated successfully, but these errors were encountered: