[SYSTEMDS-3091] MatrixBlock Initialized with Zeros is Sparse by ywcb00 · Pull Request #1362 · apache/systemds

ywcb00 · 2021-08-11T13:48:01Z

Hi,
This PR tries to remove the performance bottleneck inside the federated ctable instruction (caused by aggregating the sparse partial result matrices using a dense plus operator) by constructing a MatrixBlock as a sparse matrix if all the values are initialized to zero.

Thanks for review :)

mboehm7 · 2021-08-12T15:58:51Z

Since the implications of these changed defaults can be large, I would appreciate if we could address this ctable performance locally by passing the right parameters from outside MatrixBlock. Would that be possible or did you encounter any issues there?

… value is 0" This reverts commit 6bd5ffa.

…se matrices

ywcb00 · 2021-08-13T09:03:21Z

Yes, it is possible to address it locally.
I've changed it accordingly :)

However, I think we shouldn't forget the change of the MatrixBlock right away, because it accelerates the CTable Instruction by a factor of 100 in my experiments. Maybe it's the same with other instructions. 🤔

mboehm7 · 2021-08-13T21:44:59Z

LGTM - great catch @ywcb00 and thanks for fixing this performance issue. For local CP and distributed Spark operations we actually decide based on estimated sparsity if we do a sparse hash aggregation or an accumulation into a dense matrix. Likely the federated ctable operations was implemented for correctness so far. In your experiments, however, please test both sparse ctable scenario (like creating permutation matrices) and dense scenarios (like confusion matrices with moderate number of classes). In any case, the revised change is much better, because the previous commit affected the defaults, partially without considering the value, and unintended sparse accumulation can be vastly slower due to binary search and potential shifting. But you're right reevaluated these defaults (e.g., with our revived perftest suite). This reminds me, if feel it's beneficial, please add your federated algorithm tests to this perftest so we can automatically run them before releases.

fix(MatrixBlock.java): create matrix sparse if initialization value is 0

6bd5ffa

ywcb00 added 2 commits August 13, 2021 00:47

Revert "fix(MatrixBlock.java): create matrix sparse if initialization…

47f126e

… value is 0" This reverts commit 6bd5ffa.

fix(CtableFEDInstruction.java): create sparse matrices instead of den…

62a884f

…se matrices

asfgit closed this in 59267c9 Aug 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYSTEMDS-3091] MatrixBlock Initialized with Zeros is Sparse#1362

[SYSTEMDS-3091] MatrixBlock Initialized with Zeros is Sparse#1362
ywcb00 wants to merge 3 commits intoapache:masterfrom
ywcb00:exp/improve/runtime/instructions/fed/ctable/plusagg

ywcb00 commented Aug 11, 2021

Uh oh!

mboehm7 commented Aug 12, 2021

Uh oh!

ywcb00 commented Aug 13, 2021

Uh oh!

mboehm7 commented Aug 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ywcb00 commented Aug 11, 2021

Uh oh!

mboehm7 commented Aug 12, 2021

Uh oh!

ywcb00 commented Aug 13, 2021

Uh oh!

mboehm7 commented Aug 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants