Initial planning phase: efficient data management for persistent workers #672

wlandau · 2019-01-13T05:28:26Z

Following up on #561 (comment). My thoughts so far:

Minimize the data shuffling required to load the dependencies of each target on each persistent worker.
Maybe assign each target to the worker with the most dependency data already loaded. Here, there are tradeoffs with target runtimes when it comes to load balancing.
Maybe implement a mechanism that shares memory across workers. Maybe POSIX threads for local parallelism?

@mschubert, is this different from what you have in mind?

wlandau · 2019-04-03T19:15:38Z

Through #800 and related issues, I am reducing the size of the common data sent to the workers. We could also reduce the data sent over with each individual target and use existing worker data to come up with sensible worker affinities. @mschubert, were you thinking about something similar for #561 (comment)?

wlandau · 2019-07-12T20:04:47Z

Looks like this is a major focus of clustermq going forward: #933 (comment), mschubert/clustermq#151, mschubert/clustermq#154, mschubert/clustermq#160, mschubert/clustermq#161.

wlandau · 2019-07-12T20:05:58Z

Will reopen if anything needs to happen in drake.

wlandau added difficulty: advanced topic: performance labels Jan 13, 2019

wlandau changed the title ~~Efficient distribution of dependency data shared across persistent workers~~ Planning phase: efficient distribution of dependency data shared across persistent workers Jan 13, 2019

wlandau added this to To do in Improve performance Jan 13, 2019

wlandau moved this from To do to In progress in Improve performance Jan 15, 2019

wlandau moved this from In progress to To do in Improve performance Jan 15, 2019

wlandau changed the title ~~Planning phase: efficient distribution of dependency data shared across persistent workers~~ Initial planning phase: efficient data management for persistent workers Jan 20, 2019

wlandau mentioned this issue Apr 13, 2019

Reduce the memory footprint of the plan. #812

Closed

3 tasks

wlandau mentioned this issue May 23, 2019

Worker affinities mschubert/clustermq#147

Open

wlandau mentioned this issue Jul 2, 2019

Reuse common data mschubert/clustermq#154

Closed

wlandau closed this as completed Jul 12, 2019

Improve performance automation moved this from To do to Done Jul 12, 2019

wlandau added the status: may revisit label Jul 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial planning phase: efficient data management for persistent workers #672

Initial planning phase: efficient data management for persistent workers #672

wlandau commented Jan 13, 2019 •

edited

wlandau commented Apr 3, 2019

wlandau commented Jul 12, 2019

wlandau commented Jul 12, 2019

Initial planning phase: efficient data management for persistent workers #672

Initial planning phase: efficient data management for persistent workers #672

Comments

wlandau commented Jan 13, 2019 • edited

wlandau commented Apr 3, 2019

wlandau commented Jul 12, 2019

wlandau commented Jul 12, 2019

wlandau commented Jan 13, 2019 •

edited