update en doc for Distributed Training #9130

putcn · 2018-03-15T18:58:06Z

abhinavarora

LGTM! Just a minor typo.

abhinavarora · 2018-03-15T19:02:26Z

doc/v2/howto/cluster/index_en.rst

 .. image:: src/ps_en.png
   :width: 500

 - Data shard: training data will be split into multiple partitions, trainers use the partitions of the whole dataset to do the training job.
 - Trainer: each trainer reads the data shard, and train the neural network. Then the trainer will upload calculated "gradients" to parameter servers, and wait for parameters to be optimized on the parameter server side. When that finishes, the trainer download optimized parameters and continues its training.
 - Parameter server: every parameter server stores part of the whole neural network model data. They will do optimization calculations when gradients are uploaded from trainers, and then send updated parameters to trainers.

-PaddlePaddle can support both synchronize stochastic gradient descent (SGD) and asynchronous SGD.
+The training of synchronous random gradient descent for neural network can be archieved by cooperation of trainers and parameter servers.


achieved

putcn · 2018-03-15T20:13:06Z

thanks @abhinavarora, typo fixed.

abhinavarora

LGTM!

update en doc for Distributed Training

bcb311b

putcn requested review from abhinavarora and luotao1 March 15, 2018 18:58

abhinavarora previously approved these changes Mar 15, 2018

View reviewed changes

typo fix

bb39a8d

putcn dismissed abhinavarora’s stale review via bb39a8d March 15, 2018 20:12

abhinavarora approved these changes Mar 15, 2018

View reviewed changes

abhinavarora merged commit e382e42 into PaddlePaddle:develop Mar 15, 2018

putcn deleted the translate-distributed-training branch April 25, 2018 00:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update en doc for Distributed Training #9130

update en doc for Distributed Training #9130

putcn commented Mar 15, 2018

abhinavarora left a comment

abhinavarora Mar 15, 2018

putcn commented Mar 15, 2018

abhinavarora left a comment

update en doc for Distributed Training #9130

update en doc for Distributed Training #9130

Conversation

putcn commented Mar 15, 2018

abhinavarora left a comment

Choose a reason for hiding this comment

abhinavarora Mar 15, 2018

Choose a reason for hiding this comment

putcn commented Mar 15, 2018

abhinavarora left a comment

Choose a reason for hiding this comment