-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ubuntu 14.04, CUDA 7.0: make runtest core dumped #2432
Comments
Note: there are other failures prior to the 'make runtest' core dumped, they all seem to be related to caffe::FloatGPU or DoubleGPU: |
FWIW Similar situation (ubuntu 14.04, CUDA 7.0, cudnn-6.5-linux-x64-v2, default ATLAS. [----------] 8 tests from CuDNNNeuronLayerTest/0, where TypeParam = float |
OK, in my case the issue was my own fault. Looking around a bit more, someone indicated the invalid device comment was a dependency between CUDA and the GPU. I misread the comments in the Makefile.config. Although I had installed Cuda 7.0, I comment out the -gencode arch-compute_50 lines becaue my GPU only supports Cuda 5.2. Once I put those back in all tests passed. |
Please ask usage questions on the caffe-users list. Thanks! |
Hi dberwick, Hi Ning, Can you please kindly advise if there's a fix or workaround available? |
Refer to http://caffe.berkeleyvision.org/installation.htmlContributions simplifying and improving our build system are welcome!cuDNN acceleration switch (uncomment to build with cuDNN).USE_CUDNN := 1 CPU-only switch (uncomment to build without GPU support).CPU_ONLY := 1To customize your choice of compiler, uncomment and set the following.N.B. the default for Linux is g++ and the default for OSX is clang++CUSTOM_CXX := g++CUDA directory contains bin/ and lib/ directories that we need.CUDA_DIR := /usr/local/cuda On Ubuntu 14.04, if cuda tools are installed via"sudo apt-get install nvidia-cuda-toolkit" then use this instead:CUDA_DIR := /usrCUDA architecture setting: going with all of them.For CUDA < 6.0, comment the *_50 lines for compatibility.CUDA_ARCH := -gencode arch=compute_20,code=sm_20 BLAS choice:atlas for ATLAS (default)mkl for MKLopen for OpenBlasBLAS := atlas Custom (MKL/ATLAS/OpenBLAS) include and lib directories.Leave commented to accept the defaults for your choice of BLAS(which should work)!BLAS_INCLUDE := /path/to/your/blasBLAS_LIB := /path/to/your/blasThis is required only if you will compile the matlab interface.MATLAB directory should contain the mex binary in /bin.MATLAB_DIR := /usr/localMATLAB_DIR := /Applications/MATLAB_R2012b.appNOTE: this is required only if you will compile the python interface.We need to be able to find Python.h and numpy/arrayobject.h.PYTHON_INCLUDE := /usr/include/python2.7 Anaconda Python distribution is quite popular. Include path:Verify anaconda location, sometimes it's in root.ANACONDA_HOME := $(HOME)/anacondaPYTHON_INCLUDE := $(ANACONDA_HOME)/include \
We need to be able to find libpythonX.X.so or .dylib.PYTHON_LIB := /usr/lib PYTHON_LIB := $(ANACONDA_HOME)/libUncomment to support layers written in Python (will link against Python libs)WITH_PYTHON_LAYER := 1Whatever else you find you need goes here.INCLUDE_DIRS := $(PYTHON_INCLUDE) /usr/local/include Uncomment to use
|
I got it working now. I did nothing new except rebooting my Ubuntu desktop, everything started to work after that! Thanks, |
ubuntu 14.04, CUDA 7.0, cudnn-6.5-linux-x64-v2, 2xGeForce GTX 970 with 4G each, default ATLAS.
caffe master cloned; make all, make test successful, however, make runtest core dumped (see below):
Any idea? How to workaround this?
...
[----------] 3 tests from DeconvolutionLayerTest/3, where TypeParam = caffe::DoubleGPU
[ RUN ] DeconvolutionLayerTest/3.TestSetup
[ OK ] DeconvolutionLayerTest/3.TestSetup (0 ms)
[ RUN ] DeconvolutionLayerTest/3.TestSimpleDeconvolution
[ OK ] DeconvolutionLayerTest/3.TestSimpleDeconvolution (0 ms)
[ RUN ] DeconvolutionLayerTest/3.TestGradient
F0508 13:09:00.207708 13415 math_functions.cu:81] Check failed: error == cudaSuccess (4 vs. 0) unspecified launch failure
*** Check failure stack trace: ***
@ 0x2afa1b93bdaa (unknown)
@ 0x2afa1b93bce4 (unknown)
@ 0x2afa1b93b6e6 (unknown)
@ 0x2afa1b93e687 (unknown)
@ 0x2afa1d390478 caffe::caffe_gpu_memcpy()
@ 0x2afa1d33d18e caffe::SyncedMemory::gpu_data()
@ 0x2afa1d33e472 caffe::Blob<>::gpu_data()
@ 0x2afa1d372861 caffe::DeconvolutionLayer<>::Forward_gpu()
@ 0x456e91 caffe::Layer<>::Forward()
@ 0x45a555 caffe::GradientChecker<>::CheckGradientSingle()
@ 0x45d39b caffe::GradientChecker<>::CheckGradientExhaustive()
@ 0x6e3ccc caffe::DeconvolutionLayerTest_TestGradient_Test<>::TestBody()
@ 0x721b83 testing::internal::HandleExceptionsInMethodIfSupported<>()
@ 0x7187c7 testing::Test::Run()
@ 0x71886e testing::TestInfo::Run()
@ 0x718975 testing::TestCase::Run()
@ 0x71bcb8 testing::internal::UnitTestImpl::RunAllTests()
@ 0x71bf47 testing::UnitTest::Run()
@ 0x44607a main
@ 0x2afa1df61ec5 (unknown)
@ 0x44b3a9 (unknown)
@ (nil) (unknown)
make: *** [runtest] Aborted (core dumped)
The text was updated successfully, but these errors were encountered: