-
Notifications
You must be signed in to change notification settings - Fork 74k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error polling for event status: failed to query event: CUDA_ERROR_MISALIGNED_ADDRESS #3224
Comments
@Waffleboy, what type of GPU do you have? |
@zheng-xq GeForce GTX 860M/PCIe/SSE2, thanks! |
Please look at my comment in the other thread, and see if that fixes your problem. Thanks. |
Hi, thanks for your reply! I ran ./configure to do what you said, but now i get this strange error:
I have double and triple checked that the files are there, and that it was cuDNN v4. Should i ignore this and select default? |
Normally the versions are like 4.0.7. You can check the name of the library for the version number. ls /usr/local/cuda-7.5/lib64/libcuda.so.* |
libcuda returns nothing, but libcudnn returns 2 files. Running with system default (ie, not manually typing v4) allows me to continue and pick 5.0 for compute capability. is this ok?
|
Either system default or 4.0.7 is fine. Please let us know whether that makes a difference for you. |
I can't even generate the bottlenecks now.. It instantly fails =/
|
@martinwicke, @vrv, have you seen this error before, on Ubuntu 16.04? ImportError: /storage/git/tensorflow/bazel-bin/tensorflow/examples/image_retraining/retrain.runfiles/org_tensorflow/tensorflow/python/_pywrap_tensorflow.so: undefined symbol: _ZNK6google8protobuf7Message11GetTypeNameEv From the following link, it seems to be a compiler version related issue. @Waffleboy, which gcc version do you have? |
I dowongraded to 4.9 to use another library. Is there a way to link tensorflow only to gcc5? |
In the "configure", you should be able to specify which version of gcc you want to use. |
Thanks, that worked :) |
I also meet this error many times recently. I'm using 4 old titanx cards to run tf benchmark code. I use the version from a patch #11392 . I'm using cuda 8.0 and cudnn 6.0 on ubuntu 16.04.
|
Though I know that Titan X doesn't support GPU Direct RDMA, but could you confirm from your log? Successful GDR initialisation will print a line of log like Reproducing the issue using gRPC will do the same work. |
I have meet same error when using official gRPC protocal. |
I tried gcc-4.9, but still got |
I have the same error: 2017-09-06 18:35:49.879762: E tensorflow/stream_executor/cuda/cuda_event.cc:49] Error polling for event status: failed to query event: CUDA_ERROR_ILLEGAL_ADDRESS nvidia driver version: 375.20 |
The same issue when running on AWS with DL AMI, python3.6, TF 1.8.0 |
Summary:
Trying inceptionv3, was working fine all the way until I downgraded gcc 5+ to gcc4.9 to use Theano with keras: following this example http://deeplearning.net/software/theano/install_ubuntu.html
Now hitting this error before training starts (bottlenecks generate fine) whenever i run
bazel-bin/tensorflow/examples/image_retraining/retrain --image_dir
Cant figure out the problem. Sidenote that might help: bottlenecks generated alot faster when i used gcc 4.9 instead, but now the training crashes and i cant even run.
Environment info
Operating System:
Ubuntu 16.04
Installed version of CUDA and cuDNN:
(please attach the output of
ls -l /path/to/cuda/lib/libcud*
):ls: cannot access '/path/to/cuda/lib/libcud*': No such file or directory
It's installed in /usr/local/cuda and /usr/local/cuda-7.5 instead.
CUDA 7.5, CuDNN v4.
Install steps:
CUDA:
bash cuda_7.5.18_linux.run --override
CUDNN:
Tried both:
and from here:
http://askubuntu.com/questions/767269/how-can-i-install-cudnn-on-ubuntu-16-04
If installed from binary pip package, provide:
python -c "import tensorflow; print(tensorflow.__version__)"
.If installed from sources, provide the commit hash:
Steps to reproduce
What have you tried?
1.literally every other stack overflow / github question. eg, #2810
2.reinstalling cuda 7.5 and cudnn v4, running ./configure. no luck.
Logs or other output that would be helpful
(If logs are large, please upload as attachment).
The text was updated successfully, but these errors were encountered: