Add `localContextPropagation` to Task.Options, implement tracing Local vars #444

leandrob13 · 2017-10-01T02:46:17Z

Fixes #246.

Update:

This PR introduces the concept of Local, inspired by Twitter's Local, which is a ThreadLocal that can be transported over asynchronous boundaries by supporting implementations — in Twitter's case being their Promise implementation.

So we are introducing in monix-execution:

monix.execution.misc.Local — our own implementation of Local, inspired by Twitter's implementation
monix.execution.schedulers.TracingScheduler — wraps any Scheduler reference into an implementation that can transport locals over asynchronous boundaries
monix.eval.TaskLocal: the pure, Task-enabled Local

Besides Local which has an implementation that literally keeps its state into a ThreadLocal, the challenge is to transport these locals over asynchronous boundaries. So we've got:

Task which is now capable of transporting these locals over the async boundaries managed by its own run-loop, provided that it gets executed with Options.localContextPropagation set to true (it's set to false by default)
TracingScheduler which works for Future and all abstractions that need an ExecutionContext for managing their async boundaries

An interesting implementation detail is that Task does not have this propagation enabled by default. This is because, for now at least, this is for users that want the propagation of locals and that know what they are doing.

One way of doing that is to use executeWithOptions:

task.executeWithOptions(_.enableLocalContextPropagation)
  // triggers the actual execution
  .runAsync

Another possibility is to use runAsyncOpt:

implicit val opts = Task.defaultOptions.copy(localContextPropagation = true)

// Options passed implicitly
val f = task.runAsyncOpt

In effect what ThreadLocal is for threads, TaskLocal is for tasks. Full example:

import monix.eval.{Task, TaskLocal}

val local = TaskLocal(0)

val task: Task[Unit] =
  for {
    value1 <- local.read // value1 == 0
    _ <- local.write(100)
    _ <- Task.shift      // async boundary
    value2 <- local.read // value2 == 100
    _ <- Task.shift      // async boundary
    value3 <- local.bind(200)(local.read.map(_ * 2)) // value3 == 200 * 2
    _ <- Task.shift      // async boundary
    value4 <- local.read // value4 == 100
    _ <- local.clear
    _ <- Task.shift      // async boundary
    value5 <- local.read // value5 == 0
  } yield {
    // Should print 0, 100, 400, 100, 0
    println("value1: " + value1)
    println("value2: " + value2)
    println("value3: " + value3)
    println("value4: " + value4)
    println("value5: " + value5)
  }

import monix.execution.Scheduler.Implicits.global
val opts = Task.defaultOptions.enableLocalContextPropagation

// Actual execution
val f = task.runAsyncOpt(global, opts)

This sample doesn't seem like much, but Local is a thread-safe variable that can be used in the context of Future, and TaskLocal is meant for thread-safe and pure variables that can be used in the context of Task.

Original:

Follow-up PR to compare the original TaskRunLoop implementation with the one adapted for Local propagation as proposed in #429. The benchmarks are presented below.

Tested on a 2.5 GHz Intel Dual Core i5, 16GB of RAM, SSD hard disk.
Benchmarks:

Master:

Prev:
TaskFlatMapLongLoopBenchmark.async   10000  thrpt   20   256.402 ±  18.459  ops/s
TaskFlatMapLongLoopBenchmark.eval    10000  thrpt   20  4598.770 ± 134.952  ops/s
TaskFlatMapLongLoopBenchmark.now     10000  thrpt   20  4825.676 ± 121.606  ops/s

Next:
TaskFlatMapLongLoopBenchmark.async   10000  thrpt   20   219.106 ±   8.732  ops/s
TaskFlatMapLongLoopBenchmark.eval    10000  thrpt   20  4609.198 ±  94.487  ops/s
TaskFlatMapLongLoopBenchmark.now     10000  thrpt   20  5270.542 ± 118.069  ops/s

feature/tracedOptions:

Prev:
TaskFlatMapLongLoopBenchmark.async   10000  thrpt   20   288.565 ±  41.238  ops/s
TaskFlatMapLongLoopBenchmark.eval    10000  thrpt   20  4630.486 ±  42.603  ops/s
TaskFlatMapLongLoopBenchmark.now     10000  thrpt   20  4767.912 ± 224.114  ops/s

Next:
TaskFlatMapLongLoopBenchmark.async   10000  thrpt   20   205.929 ±  20.505  ops/s
TaskFlatMapLongLoopBenchmark.eval    10000  thrpt   20  4633.065 ±  71.458  ops/s
TaskFlatMapLongLoopBenchmark.now     10000  thrpt   20  5259.749 ± 108.196  ops/s

…es through a Local

…h trampolined execution

…dRunLoop.startAsFuture

…LocalContext

…monix into feature/LocalContext

… traced scheduler

…LocalContext

codecov · 2017-10-15T00:20:01Z

Codecov Report

Merging #444 into master will increase coverage by 0.1%.
The diff coverage is 97.45%.

@@            Coverage Diff            @@
##           master     #444     +/-   ##
=========================================
+ Coverage   89.21%   89.32%   +0.1%     
=========================================
  Files         351      356      +5     
  Lines        9525     9609     +84     
  Branches     1269     1264      -5     
=========================================
+ Hits         8498     8583     +85     
+ Misses       1027     1026      -1

alexandru

Changes look good, but I prefer that Options parameter to have a default value, not a globally implicit one. This is because we are breaking source compatibility otherwise.

alexandru · 2017-10-15T09:10:03Z