[SYSTEMDS-2972] Initial Multi-Threaded transformencode by ilovemesomeramen · Pull Request #1261 · apache/systemds

ilovemesomeramen · 2021-05-06T15:34:21Z

This PR adds basic Multithreading capability to the transform encode implementation.
Each ColumnEncoder can be executed on a separate thread or can be split up into even smaller subjob which only apply to a certain row range
Initial benchmarks with 16CPUs show up to a 50x speed improvement in comparison to the old SystemML implementation.
Currently this code is dormant, which means a call to transformencode in a DML script still uses a single threaded implementation. This will be changed when further improvements and testing are complete.
Large Matrices (e.g. 1000000x1000) are still not viable due to suspected Thread starving. This will be addressed in a future PR with some sort of access partitioning (Radix/Range).

This PR also brings back sparse support for large dummycoded matrices, which was accidentally removed in a prior PR

Added sparse support

# Conflicts: # src/main/java/org/apache/sysds/runtime/transform/encode/ColumnEncoderDummycode.java # src/test/resources/datasets/homes3/homes.tfspec_dummy_all.json

phaniarnab

Thanks for the patch @ilovemesomeramen. LGTM.
I have no comments which must be addressed before merging. However, I have a few comments and suggestions for future discussions and commits.
I fixed a few formatting issues before merging.

phaniarnab · 2021-05-12T18:45:44Z

src/main/java/org/apache/sysds/runtime/matrix/data/MatrixBlock.java

+			synchronized (sparseBlock.get(r)){
+				sparseBlock.set(r,c,v);
+			}
+		}else{
+			denseBlock.set(r,c,v);
+		}


Is the denseBlock/sparseBlock guaranteed to be allocated here?
why synchronize only sparse?

Yes it must be allocated before
Since denseblocks are just a collection of arrays and at the moment of writing to the block there should be nothing that is reading we can write concurrently without any need of synchronization, on the other hand the sparse blocks are only row independent so we need to sync over rows.

phaniarnab · 2021-05-12T18:55:42Z

src/main/java/org/apache/sysds/runtime/transform/encode/MultiColumnEncoder.java

+	public void setApplyBlockSize(int blk) {
+		APPLY_BLOCKSIZE = blk;
+	}


Can you please add a test with non-zero APPLY_BLOCKSIZE?

yes sure, i have it in my local testcases and forgot to add it

phaniarnab · 2021-05-12T19:04:22Z

src/main/java/org/apache/sysds/runtime/transform/encode/MultiColumnEncoder.java

+				for(ColumnEncoderComposite encoder : _columnEncoders) {
+					List<Callable<Object>> partialBuildTasks = encoder.getPartialBuildTasks(in, blockSize);
+					if(partialBuildTasks == null) {
+						partials.add(null);
+						continue;
+					}
+					partials.add(pool.invokeAll(partialBuildTasks));
+				}
+				for(int e = 0; e < _columnEncoders.size(); e++) {
+					List<Future<Object>> partial = partials.get(e);
+					if(partial == null)
+						continue;
+					tasks.add(new ColumnMergeBuildPartialTask(_columnEncoders.get(e), partial));
+				}


Discussion: This logic of creating tasks (column-wise row partition) restricts us from more sophisticated task creation with an arbitrary number of columns. This may not be a problem though.

Since this PR i did a lot more testing and this partial building is rather complicated, especially since a ton of intermediates are being created increasing GC. At the moment partial building is not really viable in most scenario. This will be good to discuss on Friday.

phaniarnab · 2021-05-12T19:06:24Z

src/main/java/org/apache/sysds/runtime/transform/encode/MultiColumnEncoder.java

+		int blockSize = BUILD_BLOCKSIZE <= 0 ? in.getNumRows() : BUILD_BLOCKSIZE;
+		List<Callable<Integer>> tasks = new ArrayList<>();


Discussion: What if we need column-specific block sizes in the future?

Thats a good point.
This should not be a problem, since the encoders are independent we can just call the encoders with the blocksize we need.
So in the future we might get a array of blocksizes which we then just need to match

phaniarnab · 2021-05-12T19:10:00Z

src/main/java/org/apache/sysds/runtime/transform/encode/MultiColumnEncoder.java

+			_encoder.mergeBuildPartial(_partials, 0, _partials.size());
+			return 1;


What is the significance of this hard-coded 1?

I missed that. should be null and the callable should be a Callable.

ilovemesomeramen added 11 commits March 26, 2021 16:14

some MT

0a98476

apply MT for all encoders

3eb6b65

MultiThreaded Build and apply.

1cb5eb2

Added sparse support

MultiThreaded Build cleanup

92e7267

Warning fixes

ee3d248

Merge branch 'encoder_mt'

9721ff7

# Conflicts: # src/main/java/org/apache/sysds/runtime/transform/encode/ColumnEncoderDummycode.java # src/test/resources/datasets/homes3/homes.tfspec_dummy_all.json

Updated testcases

152863f

reverted pom changes

5c9d57a

...

d79aa93

Minor formatting

9b7cbb4

Minor formatting

e1a6e45

phaniarnab approved these changes May 12, 2021

View reviewed changes

phaniarnab closed this in f4b2a2b May 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYSTEMDS-2972] Initial Multi-Threaded transformencode #1261

[SYSTEMDS-2972] Initial Multi-Threaded transformencode #1261
ilovemesomeramen wants to merge 11 commits intoapache:masterfrom
ilovemesomeramen:master

ilovemesomeramen commented May 6, 2021

Uh oh!

phaniarnab left a comment

Uh oh!

phaniarnab May 12, 2021

Uh oh!

ilovemesomeramen May 12, 2021 •

edited

Loading

Uh oh!

phaniarnab May 12, 2021

Uh oh!

ilovemesomeramen May 12, 2021

Uh oh!

phaniarnab May 12, 2021

Uh oh!

ilovemesomeramen May 12, 2021

Uh oh!

phaniarnab May 12, 2021

Uh oh!

ilovemesomeramen May 12, 2021

Uh oh!

phaniarnab May 12, 2021

Uh oh!

ilovemesomeramen May 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		int blockSize = BUILD_BLOCKSIZE <= 0 ? in.getNumRows() : BUILD_BLOCKSIZE;
		List<Callable<Integer>> tasks = new ArrayList<>();

		_encoder.mergeBuildPartial(_partials, 0, _partials.size());
		return 1;

Conversation

ilovemesomeramen commented May 6, 2021

Uh oh!

phaniarnab left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ilovemesomeramen May 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ilovemesomeramen May 12, 2021 •

edited

Loading