NVIDIA has discontinued support and maintenance for this repository. Everything is provided as-is with no further updates being accepted. Thanks for all the contributions and engagement!

Caffe

Caffe is a deep learning framework made with expression, speed, and modularity in mind. It is developed by the Berkeley Vision and Learning Center (BVLC) and community contributors.

NVCaffe

16 bit (half) floating point train and inference support.
Mixed-precision support. It allows to store and/or compute data in either 64, 32 or 16 bit formats. Precision can be defined for every layer (forward and backward passes might be different too), or it can be set for the whole Net.
Layer-wise Adaptive Rate Control (LARC) and adaptive global gradient scaler for better accuracy, especially in 16-bit training.
Integration with cuDNN v8.
Automatic selection of the best cuDNN convolution algorithm.
Integration with v2.2 (or higher) of NCCL library for improved multi-GPU scaling.
Optimized GPU memory management for data and parameters storage, I/O buffers and workspace for convolutional layers.
Parallel data parser, transformer and image reader for improved I/O performance.
Parallel back propagation and gradient reduction on multi-GPU systems.
Fast solvers implementation with fused CUDA kernels for weights and history update.
Multi-GPU test phase for even memory load across multiple GPUs.
Backward compatibility with BVLC Caffe and NVCaffe 0.15 and higher.
Extended set of optimized models (including 16 bit floating point examples).
Experimental feature (no official support) Multi-node training (since v0.17.1, NCCL 2.2 and OpenMPI 2 required).
Experimental feature (no official support) TRTLayer (since v0.17.1, can be used as inference plugin).

License and Citation

Caffe is released under the BSD 2-Clause license. The BVLC reference models are released for unrestricted use.

Please cite Caffe in your publications if it helps your research:

@article{jia2014caffe,
  Author = {Jia, Yangqing and Shelhamer, Evan and Donahue, Jeff and Karayev, Sergey and Long, Jonathan and Girshick, Ross and Guadarrama, Sergio and Darrell, Trevor},
  Journal = {arXiv preprint arXiv:1408.5093},
  Title = {Caffe: Convolutional Architecture for Fast Feature Embedding},
  Year = {2014}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4,839 Commits
3rdparty		3rdparty
classification		classification
cmake		cmake
data		data
docker		docker
docs		docs
examples		examples
include/caffe		include/caffe
matlab		matlab
models		models
packaging/deb		packaging/deb
python		python
scripts		scripts
src		src
tools		tools
.Doxyfile		.Doxyfile
.gitignore		.gitignore
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
Makefile		Makefile
Makefile.config.example		Makefile.config.example
NVCaffe-User-Guide.pdf		NVCaffe-User-Guide.pdf
NVIDIA_CLA_v1.0.1.docx		NVIDIA_CLA_v1.0.1.docx
README.md		README.md
common_plot.py		common_plot.py
plot_loss.py		plot_loss.py
plot_top1.py		plot_top1.py
plot_top5.py		plot_top5.py
plot_train_loss.py		plot_train_loss.py

License

NVIDIA/caffe

Folders and files

Latest commit

History

Repository files navigation

NVIDIA has discontinued support and maintenance for this repository. Everything is provided as-is with no further updates being accepted. Thanks for all the contributions and engagement!

Caffe

NVCaffe

License and Citation

About

Resources

License

Stars

Watchers

Forks

Languages