Skip to content
This repository has been archived by the owner on Aug 3, 2021. It is now read-only.

NVIDIA/torch-nccl

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nccl.torch

Torch7 FFI bindings for NVidia NCCL library.

Installation

Collective operations supported

  • allReduce
  • reduce
  • broadcast
  • allGather

Example usage

Argument to the collective call should be a table of contiguous tensors located on the different devices. Example: perform in-place allReduce on the table of tensors:

require 'nccl'
nccl.allReduce(inputs)

where inputs is a table of contiguous tensors of the same size located on the different devices.

Releases

No releases published

Packages

No packages published

Languages

  • Lua 97.5%
  • CMake 2.5%