Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

seqan::UnexpectedEnd #9

Closed
eldariont opened this issue Mar 2, 2017 · 3 comments
Closed

seqan::UnexpectedEnd #9

eldariont opened this issue Mar 2, 2017 · 3 comments
Assignees

Comments

@eldariont
Copy link

Hi Jongkyu,

when I run Vaquita:
vaquita -cg hg19.fa input.bam > output.vcf

I'm getting the following exception:

[2017-3-2 15:15:51] [START] Split-read analysis
[2017-3-2 15:15:57] [END] Split-read analysis (6 seconds.)
[2017-3-2 15:15:57] [START] Paired-end analysis
[2017-3-2 15:15:57] [END] Paired-end analysis (0 seconds.)
[2017-3-2 15:15:57] [START] Clipped-read analysis
[2017-3-2 15:15:57] [END] Clipped-read analysis (0 seconds.)
[2017-3-2 15:15:57] [END] BREAKPOINT IDENTIFICATION (0 seconds.)
[2017-3-2 15:15:57] Found breakpoints
[2017-3-2 15:15:57] 24428 from split-read evidences.
[2017-3-2 15:15:57] 0 from read-pair evidences.
[2017-3-2 15:15:57] 1708878 from soft-clipped evidences.
[2017-3-2 15:15:57] [START] BREAKPOINT MERGING
[2017-3-2 15:15:57] [START] From split-read evidences.
[2017-3-2 15:15:57] [END] From split-read evidences. (0 seconds.)
[2017-3-2 15:15:57] [START] From read-pair evidences.
[2017-3-2 15:15:57] [END] From read-pair evidences. (0 seconds.)
[2017-3-2 15:15:57] [START] From soft-clipped evidences.
terminate called after throwing an instance of 'seqan::UnexpectedEnd'
  what():  Unexpected end of input.
Aborted

I already tried with different input files and recreated hg19.fa.fai but all with the same result. Any idea why this is happening? 🙂

@xenigmax
Copy link
Collaborator

xenigmax commented Mar 2, 2017

Hi David, thank you for the reporting.

What I think weird is this part since it's not possible.

[2017-3-2 15:15:57] 0 from read-pair evidences

What kind of dataset did you use?
Vaquita needs short & paired-end dataset.
eg. The current version does not support long-reads dataset.

hg19.fa must contain all the reference sequences that were used in making .bam files.
Sometimes it can be different. For example, one can consider mitochondrial genome in alignment while your hg19 does not contain it.

@xenigmax xenigmax closed this as completed Mar 2, 2017
@xenigmax xenigmax reopened this Mar 2, 2017
@xenigmax
Copy link
Collaborator

xenigmax commented Mar 2, 2017

Regarding 2 you can check it like this

shell > samtools view -H input.bam

@hd VN:1.5 SO:coordinate
@sq SN:chr10 LN:135534747
@sq SN:chr11 LN:135006516
@sq SN:chr11_gl000202_random LN:40103
.
.
@sq SN:chrX LN:155270560
@sq SN:chrY LN:59373566
@rg ID:1 LB:lib1 PL:illumina SM:20 PU:unit1

All the name after "SN" should be listed like this.

shell> cat hg19.fa | grep ">"
>chr10
>chr11
>chr11_gl000202_random
.
.
>chrX
>chrY

@xenigmax xenigmax self-assigned this Mar 2, 2017
@eldariont
Copy link
Author

Thanks for the super quick reply!
Yeah, the cause of the error is that I used long read data. I didn't catch that Vaquita isn't for long reads just yet. So I will try again with short paired end reads.

Thanks again!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants