This is not the original Caffe Readme but an installation guide for windows version.
You can download the windows x64 standalone package and run directly on MNIST dataset.
You may need the followings to build the code:
- Windows 64-bit
- MS Visual Studio 2012
- CUDA toolkit 6.5
- Other dependencies which you can directly download from here.
Currently it can be built by VS2012 for x64 flatform only. This is because the dependencies mentioned above is cross-compiled to support x64 only. If you want to build on 32bit windows, you need to rebuild your own 3rd-party libraries.
- Check out the code and switch to windows branch
- Download the dependency file and extract the folders inside to project root directory.
- Include any .cpp you want to build in the
./tools
directory to MainCaller.cpp. - Open the solution file in
./build/MSVC
- Switch build target to x64 platform (ONLY release mode is OK).
- Build the code and you may find the
./bin/MainCaller.exe
- Suppose you choose to build train_net.cpp which is the default one in MainCaller.cpp
- If you do not have GPU, please change it to CPU in
lenet_solver.prototxt
- Add cudnn64_65.dll to
./bin/
- Goto directory
./examples/mnist
- Double click
get_mnist_leveldb.bat
to download the dataset in leveldb format.(ignore) - Double click
train_lenet.bat
to see the training progress .
- I have trained mnist on K40 with this windows version, take 40 seconds. The original windows-caffe taken about 5 minutes to train mnist.
- It takes obvious longer time when you compile for the first time. Therefore please refrain from using
clean & rebuild
. - To support different GPU compute capabilities, the code is built for several compute capability versions. If you know the exact version of your GPU device, you may remove the support to other versions to speed up the compiling procedure. You may wish to take a look at #25 for more details.
I have trained on ImageNet with this windows porting as well. The speed is much slower than the one built on Ubuntu. 20 iterations take 79s on Windows, whereas same number of iterations take about 30s on Ubuntu (on GTX Titan).- The above issue has been solved since the upgrade of GPU driver to 340.62 and CUDA to 6.5. The same hardware is now running 26 sencods for 20 iterations on Windows. Hooray!!
- Please create new issues in github if you find any bug.
- If you have new pull requests, they are very welcome.
- Before you do that, you may wish to read this wiki page for more information.
Please let me know if you found issues.
Thanks @happynear and NIU ZHIHENG (https://github.com/niuzhiheng/caffe) for contribution caffe on windows.