-
Notifications
You must be signed in to change notification settings - Fork 216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vcf2maf can not report VEP custom database annotation? #274
Comments
I'm running into a similar issue when trying to add additional columns from VCF files generated from VEP utilizing dbNSFP plugin. It seems like |
I've created pull request #282 which addresses this issue because it's one we've also run into. The issue is that VEP saves your custom annotations into the
In @tangwei1129 's example, you'd specify I'm not familiar with the output from the dbNSFP plug-in. @djb17, can you post a simple example? |
Hi Alan,
Super helpful. Much appreciated.
should I download a new version of vcf2maf to test the functionality?
Thanks
Alan Hoyle ***@***.***> 于2021年4月14日周三 下午12:22写道:
… I've created a pull request: #282
<#282> which addresses this issue
because it's one we've also run into.
The issue you're running into is that VEP saves your annotations into the
INFO column, but only inside the CSQ= subsection. VCF2MAF doesn't handle
those in its default configuration, as it hardcodes the CSQ fields into
an @ann_cols variable. The pull request addresses this by doing the
following:
1. adding --vep-custom STRING and --vep-config FILE options that get
passed through into subprocess like: vep --custom STRING --config FILE
subprocess
2. adding --retain-ann COMMA,SEP,VALUES which allows you to specify
which CSQ values to pass into the MAF.
In @tangwei1129 <https://github.com/tangwei1129> 's example, you'd
specify --vep-custom
/path/VEP/customDB/gnomad.genomes.r2.1.1.sites.liftover_grch38.vcf.gz,gnomADg,vcf,exact,0,AF_AFR,AF_AMR,AF_ASJ,AF_EAS,AF_FIN,AF_NFE,AF_OTH,variant_type,segdup
--retain-ann
gnomADg_AF_AFR,gnomADg_AF_AMR,gnomADg_AF_ASJ,gnomADg_AF_EAS,gnomADg_AF_FIN,gnomADg_AF_NFE,gnomADg_AF_OTH,gnomADg_variant_type,gnomADg_segdup
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#274 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AB4EHU42OBB6SOOIDLCONULTIW6MVANCNFSM4UBFMOUA>
.
|
I'm not 100% certain, but I think the easiest way would be to download my forked version to test it out. I'm not an expert in github, so there's probably some way that you could download and test from the pull request. Note that I'm not one of the main developers/maintainers, but we use it at our center. Link to my fork: |
thank you, i will test your fork version.
Alan Hoyle ***@***.***> 于2021年4月14日周三 下午1:56写道:
… I'm not 100% certain, but I think the easiest way would be to download my
forked version to test it out. I'm not an expert in github, so there's
probably some way that you could download and test from the pull request.
Note that I'm not one of the main developers/maintainers, but we use it at
our center.
Link to my fork:
https://github.com/alanhoyle/vcf2maf
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#274 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AB4EHU7GWBLKVFPUQIX2OL3TIXJM3ANCNFSM4UBFMOUA>
.
|
@tangwei1129 , any feedback on my fork/pull request? #282 Did it work for you? |
No, not yet.
Previously, we tweeted a little bit down the pipeline to make it work.
I will keep you posted when we go back to change it.
Many thanks.
Alan Hoyle ***@***.***> 于2021年4月19日周一 下午4:49写道:
… @tangwei1129 <https://github.com/tangwei1129> , any feedback on my
fork/pull request? #282 <#282> Did
it work for you?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#274 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AB4EHU4HEP6FPNOIID2Q2VTTJSJPNANCNFSM4UBFMOUA>
.
|
I tested #282 successfully as follows and merged it into the perl vcf2maf.pl --ncbi-build GRCh38 --input-vcf tests/test_b38.vcf --output-maf test_b38.vep.maf --vep-custom tests/test_b38.gnomad.exomes.r2.1.1.sites.vcf.gz,gnomAD,vcf,exact,,AC --retain-ann gnomAD_AC --ref-fasta ~/.vep/homo_sapiens/102_GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa.gz |
I have tested it worked. |
I don't think my implementation will handle multiple |
Great additions. |
I have a question about VCF2MAF, I used VEP to annotate the VCF file with new annotation of gnomAD genomes data.
$ vep -i Sample_15120+Sample_15121.FINALmutect2.vcf --vcf -o example.vcf --offline --cache --force_overwrite --fork 16 --dir_cache /path/VEP/customDB --species human --assembly GRCh38 --fasta customDB/GRCh38.d1.vd1.fa --af_gnomad --custom /path/VEP/customDB/gnomad.genomes.r2.1.1.sites.liftover_grch38.vcf.gz,gnomADg,vcf,exact,0,AF_AFR,AF_AMR,AF_ASJ,AF_EAS,AF_FIN,AF_NFE,AF_OTH,variant_type,segdup
And the output vcf has the information with the gnomad genome info.
Then I used the vcf2maf to convert to maf. But those information from gnomad genome did not show up
$ vcf2maf.pl --input-vcf example.vcf --output-maf example.maf --inhibit-vep --ref-fasta customDB/GRCh38.d1.vd1.fa --tumor-id Sample_15120 --normal-id Sample_15121 --vep-forks 16
Even I used the –retain-info to specify the INFO I want, it only added to the header, but no information still for last three columns.
$ vcf2maf.pl --input-vcf example.vcf --output-maf example1.maf --inhibit-vep --ref-fasta customDB/GRCh38.d1.vd1.fa --tumor-id Sample_15120 --normal-id Sample_15121 --vep-forks 16 --retain-info gnomADg,gnomADg_variant_type,gnomADg_segdup
but if I --retain-info CSQ, it will product all the information without separation.
Do you know how to fix the problem to have custom database shown up in the MAF file?
Thank you very much,
Wei
The text was updated successfully, but these errors were encountered: