New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ERROR Trying to train 3DSSD #426
Comments
It seems your cuda environment has something wrong. Please check the compatibility of your cuda, GPU driver and pytorch version. |
Thanks! I'll look into it. However, I strictly followed the installation steps. |
But there may still exist some problems when you need to decide which version of cuda/pytorch to be installed. You can check whether |
Does mmdet3D have specific requirements for cuda and pytorch version? am currently using cuda 11.0 and pytorch 1.7.1 |
Ah yes, if you use pytorch 1.7, maybe you need to update to the latest master or at least 0.12.0 because we fix some errors of |
I checked my versions and everything seems to be alright. Do you suggest using another pytorch and cuda version? |
Can you run |
Traceback (most recent call last): I reinstalled mmdet3d, but I get this error. However, I have mmdet3d in my env. |
After conda list: mmdet3d 0.12.0 dev_0 |
Try to run |
/home/rmoreira/.conda/envs/thesis/lib/python3.8/site-packages/torch/cuda/init.py:52: UserWarning: CUDA initialization: Found no NVIDIA driver on your system. Please check that you have an NVIDIA GPU and installed a driver from http://www.nvidia.com/Download/index.aspx (Triggered internally at /opt/conda/conda-bld/pytorch_1607370172916/work/c10/cuda/CUDAFunctions.cpp:100.)
TorchVision: 0.8.2 Since I am using a slurm server, I have to run each python script with srun. That is why in beginning it didn't find any GPU. |
Also, when I use python in cmd, I successfully import mmcv, mmdet and mmdet3d. |
Then please use srun to run the command. |
I got an error. However, I reinstalled mmdetedction3d and got a new error:
Thanks for your help. |
You need to use srun to compile mmcv/mmdet/mmdet3d. Otherwise, the local environment without GPU will only compile the lite CPU version packages. |
Ok. I'm running this command:
Thus, I need to reinstall mmcv and mmdet with srun right? Can I use srun pip....? |
Typically I use srun python -m pip install ... But you can have a try. |
Ok, I will try! Thank you! I'll soon give some feedback. |
@Tai-Wang I manage to run the demo.py! Thank you! I'll take this opportunity to show you the results. I found the first warning about incompatibility strange...
While I got this using PointPillars...
Thanks! |
Never mind about the first error. I forgot I changed the 3DSSD config file |
* fix ci * add nvidia key * remote torch * recover pytorch
* docs(docs/zh_cn): add doc and link checker * docs(REAME): update * docs(docs/zh_cn): update * docs(benchmark): update table * docs(zh_cn/benchmark): update link * CI(docs): update link check * ci(doc): update checker * docs(zh_cn): update * style(ci): remove useless para * style(ci): update * docs(zh_cn): update * docs(benchmark.md): fix mobilnet link error * docs(docs/zh_cn): add doc and link checker * docs(REAME): update * docs(docs/zh_cn): update * docs(benchmark): update table * docs(zh_cn/benchmark): update link * CI(docs): update link check * ci(doc): update checker * docs(zh_cn): update * style(ci): remove useless para * style(ci): update * docs(zh_cn): update * docs(benchmark.md): fix mobilnet link error * docs(zh_cn/do_regression_test.md): rebase * docs(docs/zh_cn): add doc and link checker * Update README_zh-CN.md * Update README_zh-CN.md * Update index.rst * Update check-doc-link.yml * [Fix] Fix ci (open-mmlab#426) * fix ci * add nvidia key * remote torch * recover pytorch * ci(codecov): ignore ci * docs(zh_cn): add get_started.md * docs(zh_cn): fix review advice * docs(readthedocs): update * docs(zh_CN): update * docs(zh_CN): revert * fix(docs): review advices * fix(docs): review advices * fix(docs): review Co-authored-by: q.yao <streetyao@live.com>
* refactor(onnx2ncnn.cpp): split it to shape_inference, pass and utils * refactor(onnx2ncnn.cpp): split it to shape_inference, pass and utils * refactor(onnx2ncnn.cpp): split code * refactor(net_module.cpp): fix build error * ci(test_onnx2ncnn.py): add generate model adn run * ci(onnx2ncnn): add ncnn backend * ci(test_onnx2ncnn): add converted onnx model` * ci(onnx2ncnn): fix ncnn tar * ci(backed-ncnn): simplify dependency install * ci(onnx2ncnn): fix apt install * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * Update backend-ncnn.yml * fix(ci): add include algorithm * Update build.yml * parent aa85760 author q.yao <streetyao@live.com> 1651287879 +0800 committer tpoisonooo <khj.application@aliyun.com> 1652169959 +0800 [Fix] Fix ci (open-mmlab#426) * fix ci * add nvidia key * remote torch * recover pytorch refactor(onnx2ncnn.cpp): split it to shape_inference, pass and utils * fix(onnx2ncnn): review * fix(onnx2ncnn): build error Co-authored-by: q.yao <streetyao@live.com>
Hello,
I had this error when trying to execute train.py for 3DSSD model:
Hope anyone has some knowledge about this.
Thanks in advance!
The text was updated successfully, but these errors were encountered: