Adding SparseLinear with CUDA #223

ebetica · 2016-02-23T00:35:38Z

Adding SparseLinear with CUDA. Most of the functions are directly converted from SparseLinear.c. Depending on how well THCudaBlas operations are pipelined, it may be more efficient to write custom kernels for most of the operations. UpdateOutput uses cusparse.

soumith · 2016-02-23T00:56:10Z

lib/THCUNN/SparseLinear.cu

+          nnz,
+          &pBufferSize
+  );
+  cudaMalloc((void**)&pBuffer, pBufferSize);


could you avoid cudaMalloc and free, and instead preallocate a buffer that is passed-in.
cudaFree causes a device synchronization, which stops us from doing multi-GPU

The buffer size is not known ahead of time... I'm not sure how I would preallocate it. Would using a THCudaStorage work?

I think since this is part of an nn layer, you can initialize it in nn, keep it around and pass it in. You can call THCudaTensor_resize() which will just resize if it needs a bigger buffer.
A similar line of code would be:
https://github.com/torch/nn/blob/master/SpatialConvolution.lua#L51
https://github.com/torch/nn/blob/master/SpatialConvolution.lua#L109
https://github.com/torch/nn/blob/master/SpatialConvolution.lua#L180

and follow the "columns" variable in here:
https://github.com/torch/cunn/blob/master/lib/THCUNN/SpatialConvolutionMM.cu

ebetica · 2016-02-23T18:47:32Z

Should be fixed, you merged the addition in nn half an hour ago Soumith, thanks.

ebetica · 2016-03-01T00:21:41Z

This is now updated with the batch version of sparse linear given in this commit

soumith · 2016-03-01T00:24:23Z

lib/THCUNN/SparseLinear.cu

+  csr_int = THCudaIntTensor_newWithSize1d(state, batchnum+1);
+  init_cusparse();
+  for (h = 0; h < batchnum+1; h++) {
+    THCudaIntTensor_set1d(state, csr_int, h, 1 + nnz * h);


make this for loop a simple-stupid CUDA kernel.

ebetica · 2016-03-14T20:34:26Z

This has been updated to work with the PR @ torch/nn#698

soumith · 2016-03-17T19:18:39Z

lib/THCUNN/SparseLinear.cu

+  thrust::copy(ptr, ptr+THCudaTensor_nElement(state, tensor), std::ostream_iterator<float>(std::cout, "\t"));
+  printf("\n");
+}
+void printCuda(THCState *state, THCudaIntTensor *tensor, char* str) {


This function seems to be double-declared here

ebetica · 2016-03-18T18:18:00Z

Fixed nits

Adding SparseLinear with CUDA

soumith · 2016-03-18T18:33:57Z

Thanks Zeming!

soumith reviewed Feb 23, 2016
View reviewed changes

ebetica force-pushed the sparse_linear branch from 53e46ee to 37e29f6 Compare February 23, 2016 17:30

ebetica force-pushed the sparse_linear branch 4 times, most recently from 3239d64 to 1a09fac Compare February 29, 2016 23:59

soumith reviewed Mar 1, 2016
View reviewed changes

ebetica force-pushed the sparse_linear branch from 1a09fac to 3fe3e15 Compare March 7, 2016 22:07

soumith mentioned this pull request Mar 9, 2016

New post : Using Autoencoders for recommender systems torch/torch.github.io#41

Open

ebetica force-pushed the sparse_linear branch from 3fe3e15 to cb84eb5 Compare March 14, 2016 20:32

ebetica force-pushed the sparse_linear branch from cb84eb5 to b1552a5 Compare March 14, 2016 22:19

soumith reviewed Mar 17, 2016
View reviewed changes

Adding SparseLinear for CUDA

cc683ed

ebetica force-pushed the sparse_linear branch from b1552a5 to cc683ed Compare March 18, 2016 18:15

ebetica mentioned this pull request Mar 18, 2016

SparseLinear has a race condition torch/nn#722

Closed

soumith added a commit that referenced this pull request Mar 18, 2016

Merge pull request #223 from ebetica/sparse_linear

64959b2

Adding SparseLinear with CUDA

soumith merged commit 64959b2 into torch:master Mar 18, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding SparseLinear with CUDA #223

Adding SparseLinear with CUDA #223

Uh oh!

ebetica commented Feb 23, 2016

Uh oh!

soumith Feb 23, 2016

Uh oh!

ebetica Feb 23, 2016

Uh oh!

soumith Feb 23, 2016

Uh oh!

ebetica commented Feb 23, 2016

Uh oh!

ebetica commented Mar 1, 2016

Uh oh!

soumith Mar 1, 2016

Uh oh!

ebetica commented Mar 14, 2016

Uh oh!

soumith Mar 17, 2016

Uh oh!

ebetica commented Mar 18, 2016

Uh oh!

soumith commented Mar 18, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Adding SparseLinear with CUDA #223

Adding SparseLinear with CUDA #223

Uh oh!

Conversation

ebetica commented Feb 23, 2016

Uh oh!

soumith Feb 23, 2016

Choose a reason for hiding this comment

Uh oh!

ebetica Feb 23, 2016

Choose a reason for hiding this comment

Uh oh!

soumith Feb 23, 2016

Choose a reason for hiding this comment

Uh oh!

ebetica commented Feb 23, 2016

Uh oh!

ebetica commented Mar 1, 2016

Uh oh!

soumith Mar 1, 2016

Choose a reason for hiding this comment

Uh oh!

ebetica commented Mar 14, 2016

Uh oh!

soumith Mar 17, 2016

Choose a reason for hiding this comment

Uh oh!

ebetica commented Mar 18, 2016

Uh oh!

soumith commented Mar 18, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants