GitHub - jklebes/matlabCUDAconvolution: matlab wrapper for CUDA 2D and 3D GPU-accelerated convolution

C++/CUDA GPU-accelerated convolution in 2D and 3D.

Based on NVIDIA cuda-samples convolutionFFT2D combined with matlab mexGPUexample.m.

I provide compiled .mexw64 files from a Windows 10 and compiled .mexa64 files from unix, which should run out of the box.
If this doesn't work for you due to different machine, a new mex compilation will be attempted and the NVIDIA CUDA toolbox - including an nvcc compiler, supported C++ compiler, and library cuFFT - must be installed.

Run functions CUDAconvolution(data, kernel) or CUDAconvolution3D(data, kernel) analogous to matlab conv2, convn.

The method is convolution by FFT, pointwise multiply, and inverse FFT.

This method is much faster in the case of medium to large kernels; outperforms matlab starting at kernel size ~12 x 12 x 12 and speedup is more than 1000x at convolution 900x900x200 with 100x100x100 kernel (test3d.mlx). Execution time should be constant and is <1s on my machine up to GPU memory limit.

Data should be the bigger array, as an array cut to original dimensions of data is returned.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
CUDAconvolution.m		CUDAconvolution.m
CUDAconvolution3D.m		CUDAconvolution3D.m
README.md		README.md
convolutionFFT2D_common.h		convolutionFFT2D_common.h
convolutionFFT3D_common.h		convolutionFFT3D_common.h
mexGPUconvolution.cu		mexGPUconvolution.cu
mexGPUconvolution.mexa64		mexGPUconvolution.mexa64
mexGPUconvolution.mexw64		mexGPUconvolution.mexw64
mexGPUconvolution3D.cu		mexGPUconvolution3D.cu
mexGPUconvolution3D.mexa64		mexGPUconvolution3D.mexa64
mexGPUconvolution3D.mexw64		mexGPUconvolution3D.mexw64
test3D.mlx		test3D.mlx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases 6

Packages

Languages

jklebes/matlabCUDAconvolution

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases 6

Packages 0

Languages

Packages