support sgd & dot & embedding with rsp #72

eric-haibin-lin · 2017-06-07T22:32:48Z

No description provided.

compiles on GPU update check alloc: Checkpoint. Pass elem-sum gpu test bug fix for copyfromto. sparse sgd test pass on gpu inefficient implementation for csr copy update submodule fix lint Simple bind with infer storage type (#32) * Symbol binding for sparse tensor development. (#31) * Initial checkin * Add init functions for simple bind in graph_executor * Add simple_bind c_api * Add simple bind c-api * Assign zeros to in_args, arg_grads, and aux_states * Add simple_bind2 python interface * Fix python interface bugs * Interface changes * Fix * Fix core dump * Add bind_ith_exec c_api * Change simple_bind2 * Fix seg fault * Finish simple_bind * Change _bind_ith_exec * Refactor simple_bind initialization flow for bind * Consolidate bind and simple_bind graph init flow * Fix bug * Clean up * Add comments * Clean up * Clean up * Minor correction * Rename APIs in graph executor * Refactor * Rebase * Delete deprecated functions * Move more front-end work to backend * Bug fix * Fix failed tests * Minor fix * Fix lint * Fix lint * Revert unnecessary changes * Revert * Revert * Clean up * Fix lint Conflicts: python/mxnet/symbol.py src/executor/graph_executor.cc * Add inferstorage to graph executor * re-enable tests for sparse embedding with simple_bind * type switch fix in sparse embedding" ; change `default` to `default_storage` for cast storage op (#33) * change default to default_storage * disable cpp test build temporarily attempt to fix windows build error, and fix lint (#34) update nnvm submodule (#37) Scipy build (#38) * update nnvm submodule * add scipy pip install for dockerfile Python3 unit tests (#39) * change xrange to range for python3 compatiblity" * remove more xrange from tests replace long with int for python3 (#40) fix the rest of TShape constructor errors (#41) fix lint (#42) fix wrong usage of mshadow::Shape1" (#43) implementation for Csr slice on cpu (#36) * CPU implementation for CSR remove seg_len from csr slice add some docs for slice csr change indptr, values, etc to be private member bug fix in sparse embedding update nnvm submoduel fix lint update unit test for sparse nd" * add const for SliceCsrIndPtr kernel Fix sparse dot according to the new RSP definition (#35) * Fix csr dot dns * Fix sparse dot * Add fallback and test cases for dot(csr, dns)=dns * Add int type switch * Fix * Fix * Fix update mshadow submodule (#44) Fix dns to rsp (#46) fix lint (#47) add runtime storage fallback detection" (#48) * add runtime storage fallback detection" * replace cast storage ex with cast storage impl Fm example (#45) * update csr slice logic to avoid confusion. add more exmaples. * add hint to module.update * more testcases(fallback) for sparse_nd * add to_csr() and to_rsp() method. More unit test (fallback now) * add fm test. fix lint * register sparse sgd under Optim.SGD * update dmlc-core submoduel * change indptr to _indptr temporarily. add const ref to fname fix lint fix lint; (#51) Guard gpu cast storage (#50) * Clean up * Fix typo Rearrange unit test files (#52)

…ix truediv for python3

* remove pyc files * add verbose for travis nosetests

* update Makefile * refactor test_sparse_operator * change `default_storage` back to `default` * remove unused cpp tests

* copied csv iter to libsvm iter test libsvm iter draft handle round batch == false for csr batch loader code refactoring add get stype, shape interface to iiter separate class for sparse iter add missing file fix mem corruption' rename variables add comments also read label from libsvm add test. update docs. update submodule Conflicts: python/mxnet/sparse_ndarray.py * update submodule * fix lint * update test * revert naming change

* add benchmark scritp for dot add gpu option for bench add get_data funciton for benchmark print t_sparse, too; add comment change nnz to dnesity add backward * add comment

* introduce CSRNDarray and rowsparseNDarray to python frontend api * temporarily disable fm_module test

* Init checkin * Fix * Adjust dot parallelization methods * Set num_omp_threads for benchmark from command line * Fix omp thread number * Clean up * Add scipy as dot baseline * Fix format

* Initial checkin * Fix bugs * Add unit test for sparse_retain * Add example and modify test

eric-haibin-lin · 2017-06-08T17:30:45Z

@reminisce could you review?

reminisce · 2017-06-08T17:25:57Z

src/operator/tensor/indexing_op.h

    auto req_type = req;
-    size_t segment_start = i * segment_len;
-    size_t segment_end = (i + 1) * segment_len;
+    auto segment_start = i * segment_len;


segment_start might be greater than num_rows if there are more threads than num_rows. Do we need a check here?

reminisce · 2017-06-08T17:27:09Z

src/operator/tensor/indexing_op.h

-        if (req_type == kWriteTo) req_type = kAddTo;
-        KERNEL_ASSIGN(dst_val[j * width + k], req_type, src[y * width + k]);
+      for (index_t k = 0; k < num_cols; k++) {
+        KERNEL_ASSIGN(dst_val[j * num_cols + k], req_type, src[y * num_cols + k]);


Since j*num_cols and y*num_cols are not dependent on the loop, is it more efficient to calculate them outside the loop?

reminisce · 2017-06-08T17:28:00Z

src/operator/tensor/indexing_op.h

  (*out_attrs)[0] = kRowSparseStorage;
-  (*out_attrs)[1] = kRowSparseStorage;
+  (*out_attrs)[1] = kDefaultStorage;


Need to use ASSIGN_CHECK?

reminisce · 2017-06-08T17:29:36Z

src/operator/tensor/indexing_op.h

+  auto num_cols = out->shape_[1];
+  // mutate req to avoid extra space allocation
+  auto out_req = req;
+  if (out_req == kWriteTo) out_req = kAddTo;


Why out_req can be mutated here?

support sgd(rsp, rsp) support dot(csr, rsp) when rsp is full add ref to const ndarray params support sparse embedding with rsp weight' fix lint modify embedding backward to produce dense grad remove invalid_rid for rsp->dns remove previous embedding op changes pass sparse embedding test add STORAGE_TYPE_ASSIGN_CHECK remove backward storage infer

eric-haibin-lin · 2017-06-09T03:37:29Z

closing this one with #75 for replacement

eric-haibin-lin and others added 16 commits May 27, 2017 05:22

fix lint. add scipy for python_test. fix scipy.sparse import error. f…

8652043

…ix truediv for python3

fix travis test (#54)

4e2f40f

* remove pyc files * add verbose for travis nosetests

cleanup some testing code and enums (#57)

be1e63b

* update Makefile * refactor test_sparse_operator * change `default_storage` back to `default` * remove unused cpp tests

add benchmark scritp for dot (#59)

965dfd7

* add benchmark scritp for dot add gpu option for bench add get_data funciton for benchmark print t_sparse, too; add comment change nnz to dnesity add backward * add comment

update fm test (#62)

cbfa792

introduce CSRNDarray and rowsparseNDarray to python frontend api (#58)

86d55df

* introduce CSRNDarray and rowsparseNDarray to python frontend api * temporarily disable fm_module test

fix lint (#64)

3e486d5

fix typo. disable libsvm io test (#65)

7c975a5

Improve dot (#61)

5c0685b

* Init checkin * Fix * Adjust dot parallelization methods * Set num_omp_threads for benchmark from command line * Fix omp thread number * Clean up * Add scipy as dot baseline * Fix format

sparse_retain op (#66)

c109eb6

* Initial checkin * Fix bugs * Add unit test for sparse_retain * Add example and modify test

add storage cast for outputs that have non-default storage (#67)

a7c8fe8

fix gpu build (#69)

39d12bc

Fix test_sparse_retain python3 issue (#68)

d27027a

update nnvm submodule (#71)

78a87ad

eric-haibin-lin force-pushed the walk-around branch 2 times, most recently from fef9f36 to 56afd54 Compare June 8, 2017 00:12

reminisce reviewed Jun 8, 2017

View reviewed changes

eric-haibin-lin force-pushed the master branch 2 times, most recently from 8c77488 to ac54762 Compare June 9, 2017 01:33

eric-haibin-lin added 2 commits June 9, 2017 03:32

revert nnvm

d6833f8

eric-haibin-lin force-pushed the walk-around branch from 0792835 to d6833f8 Compare June 9, 2017 03:34

eric-haibin-lin mentioned this pull request Jun 9, 2017

support sgd & dot & embedding with rsp #75

Merged

eric-haibin-lin closed this Jun 9, 2017

eric-haibin-lin deleted the walk-around branch June 15, 2017 20:35

eric-haibin-lin pushed a commit that referenced this pull request Apr 4, 2018

Change license file to be detectable by github (#72)

5b69ffd

eric-haibin-lin pushed a commit that referenced this pull request Apr 4, 2018

fix coreml tools (#72)

fd76749

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support sgd & dot & embedding with rsp #72

support sgd & dot & embedding with rsp #72

eric-haibin-lin commented Jun 7, 2017

eric-haibin-lin commented Jun 8, 2017

reminisce Jun 8, 2017

reminisce Jun 8, 2017

reminisce Jun 8, 2017

reminisce Jun 8, 2017 •

edited

Loading

eric-haibin-lin commented Jun 9, 2017

support sgd & dot & embedding with rsp #72

support sgd & dot & embedding with rsp #72

Conversation

eric-haibin-lin commented Jun 7, 2017

eric-haibin-lin commented Jun 8, 2017

reminisce Jun 8, 2017

Choose a reason for hiding this comment

reminisce Jun 8, 2017

Choose a reason for hiding this comment

reminisce Jun 8, 2017

Choose a reason for hiding this comment

reminisce Jun 8, 2017 • edited Loading

Choose a reason for hiding this comment

eric-haibin-lin commented Jun 9, 2017

reminisce Jun 8, 2017 •

edited

Loading