-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to unleash my full GPU power #7
Comments
In my environment (RTX3090 x 2),
|
yes, both two of my GPUs are detected, same code. But whatever i input a <1000 sequence or a ~1200 sequence, only one GPU's memory is fullfilled, the other is cold. And when a ~1200 sequence is uploaded, there would be a 'ResourceExhausted' error. |
There is a discussion about the use of multiple GPUs, google-deepmind/alphafold#149. #!/bin/bash
export TF_FORCE_UNIFIED_MEMORY=1
export XLA_PYTHON_CLIENT_MEM_FRACTION=4.0
colabfold-conda/bin/python3.7 runner.py
|
I'm using g4dn.12xlarge instance on AWS.
|
I'm completely ignorant about AWS, but which OS does the AWS use, Linux or Windows? |
Thank you for your reply. I’m using a Deep Learning image named |
Adding an environment variable |
|
i noticed that 'runner.py' seems to use both of the GPUs, but why only NO.0 GPU memory is fulfilled ? |
The first line here was the trick for me on top of other things in this thread. In total:
Taken from what's in run_docker.py from alphafold:
With 2x1080TI that was OK for a 1500-residue tetramer plus running another 400-residue prediction at the same time... however the load wasn't split between the two GPUs evenly and it was one of those alphafold predictions that gives two monomers with nearly identical coordinates and blows up the relaxation step! |
Hi, there. I have 2 RTX2080Ti(11G) GPUs with CUDA-11.2, when i input a sequence less than 1000 amino acids, it can run normally but only one of my GPU works. When i tried a ~1200 sequence or complex it will throw 'ResourceExhausted' error. So the problem is how do i let all my GPUs work and be able to calculate larger sequence or complex?
The text was updated successfully, but these errors were encountered: