-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use local copy of VEP config #231
Conversation
wait, but the point of the requester pays flag there is because if you start a cluster with |
oh or is the issue if you do a bucket other than |
The requestor pays arguments to The VEP data is downloaded using In the example, the requestor pays flag was necessary because our code loaded VEP config directly from I did start a cluster with this branch and was able to run VEP on an imported VCF.
|
This is why I wanted to get your review. It looks like Hail always requires you to provide configuration. There is no default value for the And yet, some recently added documentation claims
|
You were able to run it without a requester pays flag? I think that's probably not intentional behavior on Hail's part. @johnc1231? |
Yes. The init script downloads VEP data using gsutil, which is not affected by the
I don't think it would make sense to always require an additional requester pays argument when starting a cluster with |
Hail is also moving to use |
To this end, do we even need a config anymore? Or should we just drop it and go to the default |
I think we should do that at some point. But |
Currently, VEP configuration is downloaded from Hail's US bucket.
gnomad_methods/gnomad/utils/vep.py
Lines 65 to 74 in 2090cbb
This bucket is requestor pays, so reading from it requires that either
--requester-pays-allow-all
or--requester-pays-allow-buckets hail-us-vep
is specified when starting the cluster. If someone isn't aware of that, it's easy to start a cluster (which takes a fair amount of time with VEP) that can't run the gnomAD VEP utilities.hailctl dataproc
's VEP init scripts download these configuration files and link them to/vep_data/vep-gcloud.json
. If VEP configuration was loaded from the local copy downloaded by the init scripts instead of thehail-us-vep
bucket, then the requestor pays arguments would not be necessary.Resolves #226
Resolves #211