add TFParallel.run() API #473

leewyang · 2019-11-04T22:29:55Z

... to simplify parallel execution use-cases, which now mirrors the TFCluster.run API and map_fn signatures. This sets up the Spark executor environment for tensorflow execution (including GPU allocation), without setting up a TensorFlow cluster (via TF_CONFIG and/or cluster_spec). This does carry some basic knowledge about the rest of the executors/nodes (for data sharding) in the TFNodeContext. This API is an optional helper (specifically for GPU allocation). For CPU use-cases, users can still manually craft their own parallel RDD.mapPartitions() implementations as before.

I confirm that this contribution is made under the terms of the license found in the root directory of this repository's source tree and that I have the authority necessary to make this contribution on behalf of its copyright owner.

… use-case

Lee Yang added 2 commits November 4, 2019 13:43

minor edits

8b35d7f

add new TFParallel module to simplify parallel trainining/inferencing…

531000d

… use-case

anttisaukko approved these changes Nov 5, 2019

View reviewed changes

leewyang merged commit df468ff into master Nov 5, 2019

leewyang deleted the leewyang_parallel branch November 5, 2019 16:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add TFParallel.run() API #473

add TFParallel.run() API #473

Uh oh!

leewyang commented Nov 4, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add TFParallel.run() API #473

add TFParallel.run() API #473

Uh oh!

Conversation

leewyang commented Nov 4, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants