-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Altered genotype?/handling translocations #17
Comments
Hi, As for your first question, Jasmine does not alter genotypes; it just copies them from the input VCFs. What does the genotype look like in the individual sample's VCF file? If it's different than what Jasmine is reporting, that would indicate a bug in Jasmine, but if it is also heterozygous that indicates Sniffles is mis-genotyping it. For your second question, I have not found a way to handle translocations when using tabix. I typically remove them as well. Best, |
Hi, thank you very much for your comment! Attached are the original and merged vcf files (VCF was rejected by Github! So I converted them to txt.) sixindivi.head.txt For example, at ssa01:132539, the genotype is 1/1 in the original tanner file, but the same locus is 1|0 in the merged vcf. The command I ran was : Now I am running: |
Hi, It looks like based on the IDLIST field that the merged entry you are looking at includes the entries with ID 7 from all six input VCFs. This doesn't really make sense based on what those entries look like, so I'm thinking that the merged VCF is somehow the Brian VCF merged with itself six times since it's the only sample with a heterozygous duplication with an ID of 7 at or near that position. What are the contents of the file you are passing to Jasmine as the file_list parameter? Relatedly, when merging SVs, the merged variant's position is taken from the first input file, and does not necessarily match the positions in the other entries that got merged with it. The IDLIST INFO field parameter can help with identifying which entries were merged to form each output variant. Melanie |
Oh, well... it is possible Let me see... The input files are
This is the head of output file:
So, seemingly, the first six lines of the output imply that Jasmine read six different files. But, just after that, did Jasmine start only analyzing Brian (the last sample)? |
Hi, Thanks for the detailed information! It is indeed a bug in Jasmine - in the preprocessing step of converting duplications to insertions, it writes the updated files to its output directory but preserves the old filenames. Since all of the base filenames are the same, it writes them all to the same file and they overwrite each other. I'll work on a fix for it, but in the meantime, it should work correctly if you rename the input files to be different from each other (e.g., LLsal_minimap.SVs.phased.vcf and so on). Sorry for the trouble, and I hope that fixes it! |
Oh, that makes sense!!! Thank you very much! |
The issue has been fixed as of the latest commit: b0ca6a3 It will probably take a few days for the new release v1.1.1 to make it to bioconda though, so for now you could either rename the files as I mentioned before, or download Jasmine from source. Thanks again for pointing this out! |
Sorry again - Looks like the new Jasmine does not recognize Samtools (at least Samtools is ready in my base environment).
Exception in thread "main" java.io.IOException: Cannot run program "samtools": error=2, No such file or directory |
Thank you for bringing this to my attention! It looks like I hadn't added samtools or Iris as dependencies for Jasmine in bioconda. That's in the process of being added now, but in the meantime running Melanie |
Hi, thank you very much for developing a nice tool. I have two questions regarding Jasmine.
I used Bam -> Sniffle -> Jasmine to obtain a master VCF file with multiple individuals.
(1) Altered genotype?
This locus looks like all heterozygous in six individuals after Jasmine VCF., but in IGV of the Bam files. It looks like del/del in one and ref/ref in another. How can we interpret it? Am I misinterpreting the VCF result?
I first asked the Sniffle group, but they answered that Jasmine can alter genotypes - do you have any idea how it happens?
(2) Translocations
Translocations cannot be indexed by Tabix, because the described “ending position" is smaller than the starting position.
Currently, I remove translocations from the VCF file. If you are nicely handling translocations with Tabix, please let me know how to do so.
Thank you very much for your help!
The text was updated successfully, but these errors were encountered: