Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collisions in the adapter index should be handled more gracefully #734

Closed
marcelm opened this issue Oct 4, 2023 · 0 comments
Closed

Collisions in the adapter index should be handled more gracefully #734

marcelm opened this issue Oct 4, 2023 · 0 comments

Comments

@marcelm
Copy link
Owner

marcelm commented Oct 4, 2023

Given these two prefix adapters (plus some more so that the indexing logic is triggered) and allowing two errors:

^AAGCGCCAT
^AGGCATCAT

Cutadapt does not trim this query sequence at all:

ANGCATCATAAAAAAAAAA

And for this query, the first adapter is trimmed although the second one would have fewer errors:

AAGCATCATAAAAAAAAAA

Here are the alignments for the two adapters against the first query:

AAGCGCCAT
 X  XX
ANGCATCATAAAAAAAAAA
AGGCATCAT
 X       
ANGCATCATAAAAAAAAAA

There is a warning when creating the index that there are exist two sequences that cannot be matched uniquely. However, for the above sequences, it is clear which one is the better match.

This can lead to incorrect demultiplexing for adapters that can collide (even if the sequence that occurs in the read can unambiguously be assigned).

@marcelm marcelm closed this as completed in 511a8d2 Oct 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant