Convolution op for Theano based on CuFFT using scikits.cuda
Switch branches/tags
Nothing to show
Clone or download
Latest commit ce1182b Jul 22, 2014
Failed to load latest commit information.
.gitignore Update the readme. Jul 22, 2014


Current status

This have been merged in Theano. Don't use this repo anymore.



Convolution op for Theano based on CuFFT using scikits.cuda

This is an experiment in implementing an FFT-based convolution op for Theano, using scikits.cuda. It was inspired by this paper, which shows promising speedups for FFT-based convolutions in the Torch7 framework:

Currently this is barely functional. Input is welcome!


Current status of this repo

The implementation gives the same result as a valid convolution using Theano's own conv2d. With the implementation of NativeBatchedComplexDot op, the performance seems to be quite good (several times faster than Theano's own conv2d in many cases).

The next step is to do some proper performance testing for different input/filter sizes, with a comparison to Theano's own convop and to the cuda-convnet wrappers (where applicable).