-
Notifications
You must be signed in to change notification settings - Fork 577
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Depad2 #11
Merged
Merged
Depad2 #11
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…le in a padded reference
Note does not (yet) preserve additional @sq line tags (e.g. AS, SP, UR).
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hi Heng,
This is a replacement/update for #6
I've worked on the handling of superfluous CIGAR P operators in 'samtools depad', and found and fixed a buffer overflow when CIGAR D operators are used.
I also realized you were right originally not to accept CIGAR P operators in the input padded reference BAM file (much like it does not make sense to use CIGAR I in this case either), and restored that check.
The command line API has been extended to support SAM or BAM for input or output, and you can (and should) provide a padded reference FASTA file.
Provided a padded reference FASTA file is provided, the reference lengths in the header (including the @sq lines in the SAM header) are updated. If not, a warning is given. My preferred way to deal with this is to use any embedded references, which requires enhancements to the BAI file (see mailing list). Otherwise a pre-scan of the file could be made to find the embedded references.
Peter