Create CPU only Version #3

sguada · 2013-11-25T04:50:53Z

No description provided.

Yangqing · 2013-11-25T05:25:17Z

IMHO I wouldn't put this at a high priority. Plus it would be pretty nontrivial to implement this, requiring a lot of #ifdef macros as well as substantially rewriting the syncedmem class. We would also need to consider how we would like caffe::Caffe::mode to be... All these might make the codebase rather messy so I am not sure if it is the right move - at the end of day, it probably does not hurt too much to just ship with cuda libraries - unlike MKL, they are freely redistributable.

sguada · 2013-11-26T20:21:41Z

I agree that is not high priority, but I think that separating the cpu and gpu parts of code would be helpful for maintenance and eventually to make a CPU only version without needing CUDA to compile or run it.

Yangqing · 2013-11-26T20:26:14Z

The current code actually allows one to run without a physical GPU (as long
as cuda runtime is distributed - again it is freely allowed), which is what
I am planning to deploy on the ICSI cluster.

For developers working on caffe I assume they will at least have cuda
compilers.

Yangqing

On Tue, Nov 26, 2013 at 12:21 PM, Sergio Guadarrama <
notifications@github.com> wrote:

I agree that is not high priority, but I think that separating the cpu and
gpu parts of code would be helpful for maintenance and eventually to make a
CPU only version without needing CUDA to compile or run it.

—
Reply to this email directly or view it on GitHubhttps://github.com/Yangqing/caffe/issues/3#issuecomment-29330167
.

tdomhan · 2014-01-23T14:01:44Z

on Mac OS X 10.9 the cuda libaries are linked to libstdc++ while everything else on the system is linked to libc++ by default, due to the switch from gcc to clang. This way compiling caffe on OS X 10.9 is a huge mess right now, because you need to make sure that all the libraries you link to are linked to libstdc++, which mostly means you need to manually compile all the libraries and set the correct flags. The worst part is that you won't notice directly during compilation, but only when you try to run the program afterwards.
If there was a CPU only version, you could at least compile only the CPU code without a hassle, until nvcc will work with libc++ instead of stdlibc++.

junwang4 · 2014-01-31T15:43:33Z

Hi tdomhan,
Have you succeeded to install it on your Mac OS X 10.9? I got an error:
clang: error: unsupported option '-dumpspecs'
Then it stops to compile.

shelhamer · 2014-02-01T02:49:08Z

The OSX 10.9 situation hasn't yet changed since @tdomhan's nice summary of the issue. OS X 10.9 is not currently a feasible compilation target for GPU mode. The -dumpspecs error you mention is due to CUDA/clang incompatibility, but it is not currently possible to compile with gcc either.

For CUDA compatibility on OS X it seems to mostly be a matter of waiting for a new version of CUDA to ship that links to libc++.

sguada · 2014-02-01T02:56:37Z

The problem of compiling with gcc seems to be that when linking it detects
duplicate functions between the cpp and the cuda code. If you figure out
how to fix that, then we will be a bit closer to compile with gcc in OS X
10.9

duplicate symbol caffe::LRNLayer::LRNLayer(caffe::LayerParameter
const&)in:
src/caffe/layers/lrn_layer.o
src/caffe/layers/lrn_layer.cuo

Sergio

2014-01-31 Evan Shelhamer notifications@github.com:

The OSX 10.9 situation hasn't yet changed since @tdomhanhttps://github.com/tdomhan's
nice summary of the issue. OS X 10.9 is not currently a feasible
compilation target for GPU mode. The -dumpspecs error you mention is due
to CUDA/clang incompatibility, but it is not currently possible to compile
with gcc either.

For CUDA compatibility on OS X it seems to mostly be a matter of waiting
for a new version of CUDA to ship that links to libc++.

Reply to this email directly or view it on GitHubhttps://github.com//issues/3#issuecomment-33861457
.

tdomhan · 2014-02-01T08:02:37Z

I managed to build and run caffe under Mac OS X. I'll post the details later. It's a little hacky of course.

tdomhan · 2014-02-05T16:41:59Z

Already I finally had time to add instructions for OS X 10.9, see: f0f594c
Hope this helps.

junwang4 · 2014-02-07T04:39:32Z

Thanks, Tobias! Following your instructions, I succeeded in installing Caffe on OS X 10.9.

I did a test on the MNIST demo, but found that the GPU setting (running time: 275s) doesn't have any advantage over the CPU setting (running time: 284s) on a latest iMac. I only changed the setting "solver_mode: 1/0" in data/lenet_solver.prototxt. Do you have any comparison between the GPU and the CPU setting?

Yangqing · 2014-02-07T04:41:46Z

The MNIST demo is not likely to show the advantage of GPU over CPU, since
the model is very small and the overhead of e.g. data transfer and CPU side
control codes is big enough that GPU and CPU takes approximately the same
time. For larger models like ImageNet the GPU advantage will become clearer.

Yangqing

On Thu, Feb 6, 2014 at 8:39 PM, junwang4 notifications@github.com wrote:

Thanks, Tobias! Following your instructions, I succeeded in installing
Caffe on OS X 10.9.

I did a test on the MNIST demo, but found that the GPU setting (running
time: 275s) doesn't have any advantage over the CPU setting (running time:
284s) on a latest iMac. I only changed the setting "solver_mode: 1/0" in
data/lenet_solver.prototxt. Do you have any comparison between the GPU and
the CPU setting?

Reply to this email directly or view it on GitHubhttps://github.com//issues/3#issuecomment-34403732
.

junwang4 · 2014-02-07T04:59:40Z

Yangqing,
Thanks for your clarification! Is there any comparison data of the running time between the CPU vs. GPU on ImageNet or CIFAR?

Yangqing · 2014-02-07T05:05:11Z

I have not got detailed analysis yet. On imagenet with GPUs, full
forward+backward using Alex Krizhevsky's network takes about 7ms, and
forward only takes about 2.5ms on my desktop with a K20 (when computation
are carried out in a batch fashion). CPUs are about 10 times slower than
that, but given the multiple choices of specific CPU types it might be hard
to say exactly what the speed it is.

Yangqing

On Thu, Feb 6, 2014 at 8:59 PM, junwang4 notifications@github.com wrote:

Yangqing,
Thanks for your clarification! Is there any comparison data of the running
time between the CPU vs. GPU on ImageNet or CIFAR?

Reply to this email directly or view it on GitHubhttps://github.com//issues/3#issuecomment-34404390
.

junwang4 · 2014-02-07T15:05:34Z

Thanks - just noticed this from your CAFFE presentation
https://docs.google.com/presentation/d/1lzyXMRQFlOYE2Jy0lCNaqltpcCIKuRzKJxQ7vCuPRc8/edit#slide=id.g3c48e0e3ae4e94440

kloudkl · 2014-02-10T02:50:50Z

A not very accurate benchmark of CPU and GPU mode is done in #85. Even the result is not fair for GPU, it still shows great speed advantage.

Split source files between CUDA and CPU code. Pave the way for #3 and #122.

shelhamer · 2014-07-18T10:26:58Z

Done in #561!

Split source files between CUDA and CPU code. Pave the way for BVLC#3 and BVLC#122.

update

update from original

Add include and lib required for building with mxGPUArray support

Fix code format issues

Clean up diff noise

Fixed test exec error: lrn_ristretto_layer.cpp:16] LRN layer only supports minifloat

Two data sources

Load data

Fix build script.

ghost assigned sguada Nov 25, 2013

jermainewang mentioned this issue Jan 20, 2014

Cuda kernel crash #39

Closed

kloudkl mentioned this issue Feb 10, 2014

Replace atlas/cblas routines with Eigen in the math functions #85

Closed

sergeyk mentioned this issue Feb 25, 2014

Split CUDA code (*.cu) from CPU code (*.cpp). #152

Closed

5 tasks

shelhamer added a commit that referenced this issue Feb 27, 2014

Merge pull request #172 from erictzeng/split_cuda

40a1548

Split source files between CUDA and CPU code. Pave the way for #3 and #122.

shelhamer assigned erictzeng Feb 27, 2014

shelhamer mentioned this issue Feb 27, 2014

boost-eigen branch doesn't build on OSX 10.7, 10.9 (10.8 untested) #122

Closed

sguada mentioned this issue Feb 27, 2014

Problem with Fillers in matcaffe in OSX #178

Closed

kloudkl mentioned this issue Mar 17, 2014

How to run a pretrained model on CPU-only machine #211

Closed

GeoMetrix mentioned this issue May 24, 2014

Use 1-dim vector as input rather than images #446

Closed

shelhamer closed this as completed Jul 18, 2014

mitmul pushed a commit to mitmul/caffe that referenced this issue Sep 30, 2014

Merge pull request BVLC#172 from erictzeng/split_cuda

adac28e

Split source files between CUDA and CPU code. Pave the way for BVLC#3 and BVLC#122.

roseperrone mentioned this issue Oct 6, 2014

boost --with-python required on osx for pycaffe target #465 #1193

Closed

puzzledqs referenced this issue in puzzledqs/caffe Oct 8, 2014

Merge pull request #3 from BVLC/master

288412e

update

xnming mentioned this issue Jan 8, 2016

Segmentation fault when run test #3531

Open

shaibagon mentioned this issue Jan 11, 2016

Snapshot model weights/solver state to HDF5 files #2836

Merged

anuphalarnkar mentioned this issue Jan 12, 2016

Segmentation fault after make runtest on Ubuntu 14.04/ppc64le #3539

Closed

anuphalarnkar mentioned this issue Jan 29, 2016

Fix crash when pairing an odd number of devices without P2P (BVLC/github issue #3531) #3586

Closed

dfotland mentioned this issue Mar 8, 2016

Segfault during caffe::init #3788

Closed

bharatsau mentioned this issue Apr 22, 2016

Wrong output while using combination of DummyData and Dropout layer #4031

Closed

mtourne pushed a commit to mtourne/caffe that referenced this issue Jun 27, 2016

Merge pull request BVLC#3 from BVLC/master

9b960a0

update from original

kerolos mentioned this issue Jul 8, 2016

Segmentation fault when running extract_feature #4417

Open

EricDeveaud mentioned this issue Jul 22, 2016

make "make runtest" independant on source tree #4503

Closed

GeorgiAngelov mentioned this issue Oct 5, 2016

Training a network using an existing model for weights - do I need to rename the last layer if data is similar? #4787

Open

dtmoodie referenced this issue in dtmoodie/caffe Oct 11, 2016

Merge pull request #3 from rbgirshick/linux-mxGPUArray-compat

c0bfb1b

Add include and lib required for building with mxGPUArray support

JonBoyleCoding mentioned this issue Oct 26, 2016

Caffe stuck waiting on multiple boost::condition_variable in all threads in caffe::BlockingQueue #4904

Closed

coder-james pushed a commit to coder-james/caffe that referenced this issue Nov 28, 2016

Merge pull request BVLC#3 from Oh233/master

8a5c860

Fix code format issues

tachiang mentioned this issue Dec 9, 2016

caffe with openblas #5079

Closed

aalok1993 mentioned this issue Mar 13, 2016

Caffe - inconsistency in the activation feature values - GPU mode #2783

Closed

This was referenced Apr 14, 2017

[Python] Same accuracy every test #5144

Closed

A Classification Problem with Trained Model #1391

Closed

mvpel mentioned this issue Jul 4, 2017

Core dump following invalid device ordinal with non-zero GPU ID when CUDA_VISIBLE_DEVICES is set #5736

Open

MohanaRC mentioned this issue Jul 5, 2017

Retraining model VGG_ILSVRC_16_layers.caffemodel throws errors #5738

Closed

mbassov added a commit to mbassov/caffe that referenced this issue Aug 28, 2017

Merge pull request BVLC#3 from curalate/dev-xxxx

fad7550

Clean up diff noise

nitinsinghgit mentioned this issue Oct 13, 2017

Floating point exception in solver.cpp #5976

Closed

ananddb90 mentioned this issue Oct 24, 2017

Sparse convolutional neural networks #4328

Open

erikaemma mentioned this issue Nov 22, 2017

Error in `.build_release/tools/caffe': double free or corruption (out): 0x0000000001a09240 #6067

Open

xsmiledur mentioned this issue Dec 25, 2017

Loss output increases and always stops at 87.3365 while learning in GPU-mode, however it decreases and I can get quite good accuracy in CPU-mode. #6130

Closed

RenatGaliew mentioned this issue Apr 27, 2018

loss in googlenet #6373

Closed

cepiross pushed a commit to cepiross/caffe that referenced this issue May 13, 2018

Merge pull request BVLC#3 from m1lhaus/master

131b5b6

Fixed test exec error: lrn_ristretto_layer.cpp:16] LRN layer only supports minifloat

dkoes added a commit to gnina/caffe that referenced this issue Jun 4, 2018

Merge pull request BVLC#3 from gnina/two_data_sources

89015da

Two data sources

twmht pushed a commit to twmht/caffe that referenced this issue Aug 20, 2018

Merge pull request BVLC#3 from Russell91/load_data

987b765

Load data

fzd9752 pushed a commit to fzd9752/caffe that referenced this issue Mar 13, 2019

Merge pull request BVLC#3 from wanglei828/fixbuild

c5fc60a

Fix build script.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create CPU only Version #3

Create CPU only Version #3

sguada commented Nov 25, 2013

Yangqing commented Nov 25, 2013

sguada commented Nov 26, 2013

Yangqing commented Nov 26, 2013

tdomhan commented Jan 23, 2014

junwang4 commented Jan 31, 2014

shelhamer commented Feb 1, 2014

sguada commented Feb 1, 2014

tdomhan commented Feb 1, 2014

tdomhan commented Feb 5, 2014

junwang4 commented Feb 7, 2014

Yangqing commented Feb 7, 2014

junwang4 commented Feb 7, 2014

Yangqing commented Feb 7, 2014

junwang4 commented Feb 7, 2014

kloudkl commented Feb 10, 2014

shelhamer commented Jul 18, 2014

Create CPU only Version #3

Create CPU only Version #3

Comments

sguada commented Nov 25, 2013

Yangqing commented Nov 25, 2013

sguada commented Nov 26, 2013

Yangqing commented Nov 26, 2013

tdomhan commented Jan 23, 2014

junwang4 commented Jan 31, 2014

shelhamer commented Feb 1, 2014

sguada commented Feb 1, 2014

tdomhan commented Feb 1, 2014

tdomhan commented Feb 5, 2014

junwang4 commented Feb 7, 2014

Yangqing commented Feb 7, 2014

junwang4 commented Feb 7, 2014

Yangqing commented Feb 7, 2014

junwang4 commented Feb 7, 2014

kloudkl commented Feb 10, 2014

shelhamer commented Jul 18, 2014