-
Notifications
You must be signed in to change notification settings - Fork 193
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
N-containing reads never align, so it can appear that we never succeed in including them #167
Comments
The problem is that the mapper cannot align sequences that have too many Ns in them! As a result we can include these particular edits in the graph, but no alignment will exactly match them. This breaks path labeling. So we can take the failed alignment and include it, yielding: vg mod -i 'gi|157734152:29655295-29712160-failed-edit.gam' 'gi|157734152:29655295-29712160-failed-edit.vg' But if you notice, these alternative paths are exactly equivalent, and a round of normalization will merge them back together. Then, if we align again we'll see that the N-containing fragment does not map, and conclude that our editing process failed as we can't get a perfect walk through the graph to match our input sequence. |
Two possible solutions.
The latter looks somewhat attractive. However, it isn't easy, because the editing semantics need to be adjusted to keep track of the reference-matching bits and also to tag the additional pieces that the edits add. |
Both solutions could be implemented. Neither would hurt. The concern is that there may still be parts of the graph that can't be aligned to (as in 1) but this problem "goes away" in 2. However... we still need to test for path inclusion if we want to debug graph construction. So. Hm. |
This has been resolved for some time. |
This demonstrates a failure (via msga) of path editing:
I've attached the graph and the alignment generating this at this point in the process.
failed-edit.gam.json.txt
failed-edit.vg.json.txt
Not clear what's going on, but now it's testable.
The text was updated successfully, but these errors were encountered: