Interfaces for Dynamic Optimization #31

johnyangk · 2016-12-31T07:22:28Z

Currently, we assume that the compiler optimization happens just once, before the job commences.

Let's allow the optimization to happen multiple times, at runtime. We need to carefully think about how the interfaces between different components in the system should change.

The execution flow might look like this: The engine feeds runtime metrics into the compiler optimizer, which outputs a new IR for the compiler backend. The compiler backend then manipulates the JobDAG, with which the engine resumes execution.

johnyangk · 2017-03-30T02:52:19Z

OK, so I talked with Eunji(our new lab member) and got some great insights for this.

In traditional compilers, jit compilation is usually used for code blocks that are executed multiple times(think loops). They generate a complete execution plan prior to the execution, and then as the iteration goes on, re-write the plan based on obtained stats.

In contrast, the DAGs we execute do not contain cycles, and runtime optimization(e.g., dynamic partitioning) of an operator usually depends on the results of previous operators. This lets us get away with a simpler design where the compiler generates the execution plan piece by piece, without having to re-write the plan across multiple layers(ir/runtime-logical/runtime-physical).

For example, let's say we're applying a dynamic partitioning optimization to a MapReduce application. First, the compiler generates a partial execution plan for Map and dynamically added KeyHistogram operators. After that, it can generate the rest of the execution plan for Reduce. We do not need to do any re-writes during the process.

I know Optimus does re-writing. But my feeling is that it is more to do with the system's legacy code than with the actual requirements.

Of course, we might want to do a re-write while executing the operator(e.g., change the number of reducers while executing the reducer), especially for streaming applications. Thus, I guess we need the re-writing mechanism after all? Then I guess we really need to put in efforts to make the re-writing across multiple layers easy. What are your thoughts?

johnyangk · 2017-03-30T03:26:04Z

Hmm... I guess we need the re-writing mechanism after all. Even the Beam applications that we have now(MLR, ALS) have implicit conditional cycles/loops. It'd be great if we can explicitly express the cycles/loops in our IR, and use the traditional compiler's techniques on them.

bgchun · 2017-03-30T03:31:02Z

How do we handle the cycles/loops currently?
It'd be nice to have cycles/loops in our IR.

johnyangk · 2017-03-30T03:42:54Z

Currently, we do not have cycle/loop in our IR. I'll file an issue for it.
Note that Beam also does not have cycle/loop in its language. I suppose we need to identify it ourselves in our Beam frontend.

bgchun · 2017-03-30T04:23:19Z

Then, how do we handle cycles (iterations) now?

johnyangk · 2017-03-30T04:27:04Z

Right now, we just have a long DAG with duplicate sets of operators.
Because Beam does not have the concept of conditional loop, we have fixed number of iterations for ALS and MLR.
For example, if we have 2 operators in a loop, and there are 10 iterations, we simply have a long chain of 20 operators.

bgchun · 2017-03-30T23:23:57Z

@johnyangk Do you plan to add a loop in the current IR? This can be a good topic to discuss in our meeting.

johnyangk · 2017-03-31T00:40:28Z

Sure this is a good discussion topic. If we have concrete optimization techniques to use for loops, then we can make this a high priority task , since this is something the Optimus paper did not have. However, if we don't, then I think it'd be better to postpone this task.

bgchun · 2017-03-31T00:45:06Z

If we decide to add a loop, there are two potential approaches

Add a high-level loop construct
Add a low-level jump/condition construct to create a loop

This PR: - receives optimization policies as a parameter - it can be either "pado", "disaggregation", or "runtime_opt" (runtime_opt to be supported through #31) - receives the parameter as an argument following the program class name argument - applies tests that runs different policies Resolves #52

wonook · 2017-04-10T11:22:38Z

Loops will be handled with #121

wonook · 2017-08-29T16:14:25Z

See issues marked with DynOpt

This PR: - receives optimization policies as a parameter - it can be either "pado", "disaggregation", or "runtime_opt" (runtime_opt to be supported through #31) - receives the parameter as an argument following the program class name argument - applies tests that runs different policies Resolves #52

johnyangk added Client Compiler Runtime major labels Dec 31, 2016

wonook mentioned this issue Mar 16, 2017

Vortex Compiler #8

Closed

14 tasks

wonook mentioned this issue Apr 5, 2017

[Vortex-#52] Receive optimization policies as a parameter #133

Merged

wonook self-assigned this Apr 10, 2017

johnyangk removed the major label Apr 15, 2017

wonook changed the title ~~Interfaces for Runtime Optimization~~ Interfaces for Dynamic Optimization Jun 11, 2017

jooykim added this to the Sailfish on Vortex milestone Jun 28, 2017

wonook closed this as completed Aug 29, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interfaces for Dynamic Optimization #31

Interfaces for Dynamic Optimization #31

johnyangk commented Dec 31, 2016

johnyangk commented Mar 30, 2017

johnyangk commented Mar 30, 2017

bgchun commented Mar 30, 2017

johnyangk commented Mar 30, 2017

bgchun commented Mar 30, 2017

johnyangk commented Mar 30, 2017

bgchun commented Mar 30, 2017

johnyangk commented Mar 31, 2017

bgchun commented Mar 31, 2017

wonook commented Apr 10, 2017

wonook commented Aug 29, 2017

Interfaces for Dynamic Optimization #31

Interfaces for Dynamic Optimization #31

Comments

johnyangk commented Dec 31, 2016

johnyangk commented Mar 30, 2017

johnyangk commented Mar 30, 2017

bgchun commented Mar 30, 2017

johnyangk commented Mar 30, 2017

bgchun commented Mar 30, 2017

johnyangk commented Mar 30, 2017

bgchun commented Mar 30, 2017

johnyangk commented Mar 31, 2017

bgchun commented Mar 31, 2017

wonook commented Apr 10, 2017

wonook commented Aug 29, 2017