Troubleshooting

The current CUDA 11 instructions need a quite recent glibc version (>=2.29). Not all systems provide such a recent version. However, you can manually compile it and force crYOLO to use it:

Download and compile a recent glibc (>= 2.29)

bash $

wget http://ftp.gnu.org/gnu/libc/glibc-2.34.tar.xz tar xvf glibc-2.34.tar.xz mkdir glibc-2.34/build cd glibc-2.34/build sudo mkdir /opt/glibc-2.34 ../configure --prefix=/opt/glibc-2.34 make -j 8 sudo make install

Add environment variable for the cryolo environment ( I assume the environment name is "cryolo"):

bash $

conda activate cryolo conda env config vars set LD_PRELOAD=/opt/glibc-2.34/lib/libm.so.6

Reload your environment

bash $

conda deactivate conda activate cryolo

Now you should be able to run cryolo with CUDA 11.

Thanks to Wolfgang Lugmayr for the instructions!

Note

Since crYOLO 1.7.4 this problem is solved. Multithreading replaced multiprocessing.

On some machines crYOLO freezes during or at the end of training. The problem comes together with multiprocessing and is deeply in one of the libraries we use. You can solve it by using multithreading instead of multiprocessing. There for you can either use the option --use_multithreading or make it a permanent change by changing the environment variables in your crYOLO environment:

bash $

conda activate cryolo conda env config vars set CRYOLO_MP_START="fork" conda env config vars set CRYOLO_USE_MULTITHREADING="True"

You need to reactivate your environment to make the changes working by

bash $

conda activate cryolo

Now you use multithreading instead of multiprocessing.

crYOLO has memory problems

crYOLO can crash during training because of memory problems. In those cases you can try the following:

Reduce the batch_size. I recommend to reduce it by 1 stepwise. I would not choose a value below 3. You find the batch_size in your configuration file or in the Training options tab of the config Action
Reduce the input_size. Instead of 1024 you can choose any multiple of 32. Therefore 31*32=992 would next smaller input size. Don't go too low (< 768) as you might become problem with very small particles. You find the input_size in your configuration file or in the Model options tab of the config Action.

I need more help

Find help at our mailing list!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

troubleshooting.rst

troubleshooting.rst

Troubleshooting

crYOLO has memory problems

I need more help

Files

troubleshooting.rst

Latest commit

History

troubleshooting.rst

File metadata and controls

Troubleshooting

crYOLO has memory problems

I need more help