You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please make sure that this is a bug. As per our GitHub Policy, we only address code/doc bugs, performance issues, feature requests and build/installation issues on GitHub. tag:bug_template
System information
Have I written custom code (as opposed to using a stock example script provided in TensorFlow):
OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Redhat 7
TensorFlow installed from (source or binary): source
TensorFlow version (use command below): 1.15.3+nv20.07
Python version: 3.8.5
Bazel version (if compiling from source): 0.26.1
GCC/Compiler version (if compiling from source):9.3.0
CUDA/cuDNN version:Cuda 11.0.207 cuDNN 8.0.1.13
GPU model and memory: Nvidia A100
Describe the current behavior
(...) external/local_config_cuda/crosstool/clang/bin/crosstool_wrapper_driver_is_not_gcc -shared -o bazel-out/k8-py2-opt/bin/tensorflow/python/_tf_stack.so '-Wl,-rpath,$ORIGIN/,-rpath,$ORIGIN/..' -Wl,--version-script bazel-out/k8-py2-opt/bin/tensorflow/python/_tf_stack-version-script.lds -Wl,-no-as-needed -Wl,-z,relro,-z,now '-Wl,--build-id=md5' '-Wl,--hash-style=gnu' -no-canonical-prefixes -fno-canonical-system-headers -B/usr/bin -Wl,--gc-sections -Wl,@bazel-out/k8-py2-opt/bin/tensorflow/python/_tf_stack.so-2.params)
Execution platform: @bazel_tools//platforms:host_platform
/usr/bin/ld.gold: --push-state: unknown option
/usr/bin/ld.gold: use the --help option for usage information
collect2: error: ld returned 1 exit status```
**Describe the expected behavior**
Fixed ld.gold versions work
**Code to reproduce the issue**
Compile with standard Redhat's GCC
**Other info / logs**
The solution is to use a properly patched libtool, and remove the hardcoded path to /usr/bin. The fine gentlemen of the EasyBuild project have a patch that does exactly this: https://github.com/easybuilders/easybuild-easyconfigs/blob/master/easybuild/easyconfigs/t/TensorFlow/TensorFlow-1.13.1_remove_usrbin_from_linker_bin_path_flag.patch
The text was updated successfully, but these errors were encountered:
someone at NVidia can't fix a bug where things are hardcoded.
It is more than that. nvidia-tensorflow is build using Ubuntu 18.04 system so all dependencies - linked, libraries, and their versions depend on this.
Making it distribution independent is not an easy task - you can read more about this in PEP. We are aware of this limitation, we have a few ideas on how to address it but it is hard to commit to any official timeline.
Still, you can use a containerized environment.
Please make sure that this is a bug. As per our GitHub Policy, we only address code/doc bugs, performance issues, feature requests and build/installation issues on GitHub. tag:bug_template
System information
Describe the current behavior
The text was updated successfully, but these errors were encountered: