Issues with VEP Script #18

youssefb99 · 2024-01-22T10:25:59Z

youssefb99
Jan 22, 2024

Title: Using Hugging Face Transformers

Hello,

I hope this post finds you well. I'm currently facing challenges running the VEP inference script. I've faced an error related to mixed precision training when attempting to run the script on GPU. As an alternative, I'm interested in exploring the possibility of running the code on CPU instead.

This is the error I get:

Namespace(center_window_size=None, command='vep', dataloader_num_workers=0, disable_aux_features=False, input_path='example.vcf', is_file=True, model_path='songlab/gpn-msa-sapiens', msa_path='zip:///::https://huggingface.co/datasets/songlab/multiz100way/resolve/main/89.zarr.zip', output_path='example.preds.parquet', per_device_batch_size=25, split='test', window_size=128)
Loading MSA...
Loading MSA... Done

ValueError: FP16 Mixed precision training with AMP or APEX (--fp16) and FP16 half precision evaluation (--fp16_full_eval) can only be used on CUDA or NPU devices or certain XPU devices (with IPEX).

torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
gpn.msa.inference FAILED

I've successfully run the simple_example script, indicating that the library is working well in my environment.
However, when attempting to use the vep script, I encountered the above error related to mixed precision training.
Is it possible to run the VEP inference script on CPU instead of GPU?
If yes, what adjustments should I make to the code or configurations to facilitate running on CPU?

Thank you in advance for any guidance or insights you can provide!

Answered by gonzalobenegas

Jan 22, 2024

Hi, sorry for your troubles! fp16 should certainly be an optional argument. For now, you can remove this line or set to False:

gpn/gpn/msa/inference.py

Line 39 in 05b23c5

fp16=True,

Additionally, in the notebook you might need to replace
{torchrun_path} --nproc_per_node={n_gpu} with {python_path} (which could just be python)
Let me know if you get it to work.

View full answer

gonzalobenegas · 2024-01-22T17:48:38Z

gonzalobenegas
Jan 22, 2024
Maintainer

Hi, sorry for your troubles! fp16 should certainly be an optional argument. For now, you can remove this line or set to False:

gpn/gpn/msa/inference.py

Line 39 in 05b23c5

fp16=True,

Additionally, in the notebook you might need to replace
{torchrun_path} --nproc_per_node={n_gpu} with {python_path} (which could just be python)
Let me know if you get it to work.

0 replies

youssefb99 · 2024-01-31T08:22:05Z

youssefb99
Jan 31, 2024
Author

Hello gonzalo,
Thanks for your prompt answer.
I made the necessary adjustments and the code now works just fine.
Thanks again for your help.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues with VEP Script #18

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Issues with VEP Script #18

youssefb99 Jan 22, 2024

Replies: 2 comments

gonzalobenegas Jan 22, 2024 Maintainer

youssefb99 Jan 31, 2024 Author

youssefb99
Jan 22, 2024

gonzalobenegas
Jan 22, 2024
Maintainer

youssefb99
Jan 31, 2024
Author