Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Install DALI from source got error "file format not recognized" and "syntax error" #5269

Open
1 task done
NaufalRezkyA opened this issue Jan 7, 2024 · 3 comments
Open
1 task done
Assignees
Labels
question Further information is requested

Comments

@NaufalRezkyA
Copy link

NaufalRezkyA commented Jan 7, 2024

Describe the question.

Hi, I tried to install DALI from source but got an error while building it. Any Suggestions? Thank you.
Machine: Ubuntu20.04, CUDA 11.2, Docker 24.0.7

Command that I used:

git clone https://github.com/NVIDIA/DALI.git
git submodule sync --recursive
git submodule update --init --recursive
sudo ./build.sh

Error while install the DALI:

.....
[100%] Built target lint
++ grep '^processor' /proc/cpuinfo
++ wc -l
+ make -j48
[  0%] Built target nvcv_util_compat
[  0%] Built target cocoapi
[  0%] Built target create_nvcv_types_exports_file
[  0%] Built target create_cvcuda_exports_file
[  0%] Built target DALI_PROTO
[  1%] Built target dynlink_cuda
[  1%] Built target nvcv_util_sanitizer
[  1%] Built target CAFFE2_PROTO
[  2%] Built target TF_PROTO
[  2%] Built target CAFFE_PROTO
[  3%] Built target ffts_static
[  3%] Built target NVCUVID_GEN
[  3%] Built target gtest
[  4%] Built target cvcuda_version_file
[  4%] Built target dynlink_nvml
[  6%] Built target benchmark
[  6%] Built target nvcv_util
[  8%] Built target dali_core
[ 11%] Built target nvcv_types_priv
[ 11%] Linking CXX shared library ../../../../lib/libnvcv_types.so
/opt/rh/devtoolset-10/root/usr/libexec/gcc/x86_64-redhat-linux/10/ld:/opt/dali/third_party/cvcuda/src/util/stubs/libdl-2.17_stub.so: file format not recognized; treating as linker script
/opt/rh/devtoolset-10/root/usr/libexec/gcc/x86_64-redhat-linux/10/ld:/opt/dali/third_party/cvcuda/src/util/stubs/libdl-2.17_stub.so:1: syntax error
collect2: error: ld returned 1 exit status
make[2]: *** [lib/libnvcv_types.so.0.4.0] Error 1
make[1]: *** [third_party/cvcuda/src/nvcv_types/CMakeFiles/nvcv_types.dir/all] Error 2
make: *** [all] Error 2

Check for duplicates

  • I have searched the open bugs/issues and have found no duplicates for this bug report
@NaufalRezkyA NaufalRezkyA added the question Further information is requested label Jan 7, 2024
@JanuszL
Copy link
Contributor

JanuszL commented Jan 8, 2024

Hi @NaufalRezkyA,

Thank you for reporting this. This is an unfortunate implicit dependency of cv-cuda. Before cloning the DALI repo please install git lfs, as some cv-cuda source code artifacts are stored using this approach.

@jantonguirao jantonguirao assigned JanuszL and unassigned jantonguirao Jan 8, 2024
@NaufalRezkyA
Copy link
Author

Hi @JanuszL , thanks it works now after i installed git lfs and clone the repository again.

But, how to access the system after the installation is finished?
I tried to access the image like this, but there is no dali or cuda in the image (i also tried tag cu123_x86_64.deps, cuda123_x86_64.toolkit, x86_64.deps ):

cc@naufal-gpufs-dali2-rtx:~$ docker container run -it nvidia/dali:cu123_x86_64.build /bin/bash
[root@72d834abfbc6 dali]# python
Python 3.8.18 (default, Jan  8 2024, 07:39:02) 
[GCC 10.2.1 20210130 (Red Hat 10.2.1-11)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import nvidia.dali
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ModuleNotFoundError: No module named 'nvidia'
>>> exit()
[root@72d834abfbc6 dali]# nvidia-smi
bash: nvidia-smi: command not found
REPOSITORY    TAG                      IMAGE ID       CREATED        SIZE
nvidia/dali   cu123_x86_64.build       98e04cb8b610   4 hours ago    13.8GB
nvidia/dali   cu123_x86_64.deps        839b5e412643   4 hours ago    13.7GB
nvidia/dali   cuda123_x86_64.toolkit   ff83953c31df   4 hours ago    7.79GB
nvidia/dali   x86_64.deps              a6a97a9147a8   4 hours ago    6.41GB
hello-world   latest                   d2c94e258dcb   8 months ago   13.3kB

@JanuszL
Copy link
Contributor

JanuszL commented Jan 8, 2024

Hi @NaufalRezkyA,

By default, the build process mounts your DALI source code directory inside the container. The build artifacts should be placed inside build-docker-* if that is what you are looking for.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants