Permalink
Commits on Jun 20, 2017
  1. Update README.md

    cypof committed on GitHub Jun 20, 2017
Commits on Jun 14, 2017
  1. Merge pull request #5687 from BVLC/readme_list_branches

    List branches in readme
    cypof committed on GitHub Jun 14, 2017
Commits on Jun 13, 2017
  1. List branches in readme

    cypof committed on GitHub Jun 13, 2017
Commits on Apr 15, 2017
  1. Merge pull request #5530 from willyd/nccl-py3

    Explicit std::string to bp::object conversion
    cypof committed on GitHub Apr 15, 2017
Commits on Apr 14, 2017
  1. Docker update to cuDNN 6

    cypof committed Apr 14, 2017
Commits on Apr 13, 2017
  1. Merge pull request #5527 from willyd/nccl-py3

    Added support for python 3 and NCCL
    cypof committed on GitHub Apr 13, 2017
Commits on Apr 12, 2017
  1. Fix log parsing #5422

    cypof committed Apr 12, 2017
  2. Merge pull request #5474 from willcrichton/master

    Fixed memory leaks in cudnn conv and relu
    cypof committed on GitHub Apr 12, 2017
  3. Merge pull request #5408 from cypof/multi_infer

    Init test network on all GPUs
    cypof committed on GitHub Apr 12, 2017
  4. Merge pull request #5455 from cypof/remove_shared_parallel

    Remove missed legacy parallel code
    cypof committed on GitHub Apr 12, 2017
Commits on Mar 27, 2017
Commits on Mar 14, 2017
Commits on Mar 13, 2017
  1. Merge pull request #5393 from jfolz/master

    Multi GPU training from Python can use any solver
    cypof committed on GitHub Mar 13, 2017
Commits on Feb 8, 2017
Commits on Jan 23, 2017
  1. Merge pull request #5215 from cypof/fix_restore

    Restore can be invoked on rank > 0
    cypof committed on GitHub Jan 23, 2017
Commits on Jan 20, 2017
  1. Merge pull request #5153 from cypof/docker

    Docker refresh: simplified & update to 16.04, cuda8, cudnn5, nccl
    cypof committed on GitHub Jan 20, 2017
Commits on Jan 14, 2017
  1. Merge pull request #5075 from tsocha/master

    Fix mkl issue #4836
    cypof committed on GitHub Jan 14, 2017
Commits on Jan 6, 2017
  1. Python Multi-GPU

    cypof committed Nov 23, 2016
  2. Switched multi-GPU to NCCL

    cypof committed Jan 6, 2017
Commits on Oct 1, 2016
  1. NV changed path to cudnn

    cypof committed Oct 1, 2016
Commits on Sep 9, 2016
Commits on Aug 14, 2015
Commits on Aug 9, 2015
  1. Multi-GPU

    - Parallelize batches among GPUs and tree-reduce the gradients
    - The effective batch size scales with the number of devices
    - Batch size is multiplied by the number of devices
    - Split batches between GPUs, and tree-reduce the gradients
    - Detect machine topology (twin-GPU boards, P2P connectivity)
    - Track device in syncedmem (thanks @thatguymike)
    - Insert a callback in the solver for minimal code change
    - Accept list for gpu flag of caffe tool, e.g. '-gpu 0,1' or '-gpu all'.
      Run on default GPU if no ID given.
    - Add multi-GPU solver test
    - Deterministic architecture for reproducible runs
    cypof committed with shelhamer May 19, 2015
  2. Allocate host memory through cudaMallocHost

    thanks to discussion by @thatguymike and @flx42
    cypof committed with shelhamer May 19, 2015
  3. Add DataReader for parallel training with one DB session

    - Make sure each solver accesses a different subset of the data
    - Sequential reading of DB for performance
    - Prefetch a configurable amount of data to host memory
    - Distribute data to solvers in round-robin way for determinism
    cypof committed with shelhamer May 19, 2015
  4. Persistent prefetch thread

    cypof committed with shelhamer May 19, 2015
  5. Change the way threads are started and stopped

    - Interrupt the thread before waiting on join
    - Provide a method for looping threads to exit on demand
    - CHECK if start and stop succeed instead of returning an error
    cypof committed with shelhamer Apr 28, 2015
  6. Thread-local Caffe

    cypof committed with shelhamer Apr 28, 2015