Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SAM format screwup #9

Open
calkan opened this issue Dec 18, 2017 · 7 comments
Open

SAM format screwup #9

calkan opened this issue Dec 18, 2017 · 7 comments

Comments

@calkan
Copy link
Collaborator

calkan commented Dec 18, 2017

after generating SAM with lordfast, trying to convert to BAM:

[W::sam_parse1] mapped query cannot have zero coordinate; treated as unmapped
....

25479 272 22816 0 50 54M * 0 0 TATATCGATATCTACACTCTAGATGCTCATGCTAGTCTCACAGCGACAGCTCAT * AS:i:0
12838 272 47754 0 50 60M * 0 0 CGAGTGATATGCAGTGAGCGAGATCACGCACTGCACAGCTGAGCAGAGCAGACTCATCTC * AS:i:0

and so on. Are you reporting map location 0-based? SAM defines it as 1-based; BAM as 0-based

@calkan
Copy link
Collaborator Author

calkan commented Dec 18, 2017

added +1 to coordinates, now I still have the same warning. and also this:

[E::sam_parse1] CIGAR and query sequence are of different length

@haghshenas
Copy link
Collaborator

Thank you for reporting.
I'm on it.

@calkan
Copy link
Collaborator Author

calkan commented Dec 19, 2017

this is probably related; the location is reported to be INT_MAX:

611456 0 32008 4294967294 50 ....

many more occurrences of INT_MAX like this.

@calkan
Copy link
Collaborator Author

calkan commented Dec 19, 2017

use this: https://genome.sph.umich.edu/wiki/BamUtil:_validate

many occurrences of:

Record 15831
FAIL_PARSE: position, 4294967280, does not fit in a 32 bit signed int.

also 3 CIGAR fails according to the summary:

Number of records read = 4614502
Number of valid records = 4611875

Error Counts:
FAIL_PARSE: 2624
INVALID_CIGAR: 3

@haghshenas
Copy link
Collaborator

Could you try the latest version on master and check if the problem persists?

@calkan
Copy link
Collaborator Author

calkan commented Dec 21, 2017

CIGAR error is gone but this one is still here:

Record 953384
FAIL_PARSE: position, 4294967295, does not fit in a 32 bit signed int.

Record 1002084
FAIL_PARSE: position, 4294967295, does not fit in a 32 bit signed int.

@haghshenas
Copy link
Collaborator

Hi Can,

This issue should be fixed now. Can you confirm?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants