Fix some bug of sequence_parallel #746

GhostScreaming · 2022-09-16T16:11:38Z

Add sequence_parallel option for GPTModel
When mp=1, sequence_parallel option should
always be set False

1. Add sequence parallel strategy for GPTModelHybrid 2. Output has been checked layer by layer both in forward and backward progress, and its loss curve of the beginning 5000 steps fits the peer 3. Performance is improved for about 10% with sequence_parallel strategy compared with pretrain_gpt_1.3B_mp8

… sequence_parallel

1. Add sequence_parallel option for GPTModel 2. When mp=1, sequence_parallel option should always be set False

ForFishes

LGTM

GhostScreaming added 4 commits September 16, 2022 07:14

Add sequence_parallel_utils.py file

f73dabe

Merge branch 'develop' of https://github.com/PaddlePaddle/FleetX into…

d1cc3b7

… sequence_parallel

Fix some bug of sequence_parallel.

c35dc57

1. Add sequence_parallel option for GPTModel 2. When mp=1, sequence_parallel option should always be set False

ForFishes approved these changes Sep 17, 2022

View reviewed changes

ForFishes merged commit d6c186d into PaddlePaddle:develop Sep 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix some bug of sequence_parallel #746

Fix some bug of sequence_parallel #746

GhostScreaming commented Sep 16, 2022

ForFishes left a comment

Fix some bug of sequence_parallel #746

Fix some bug of sequence_parallel #746

Conversation

GhostScreaming commented Sep 16, 2022

ForFishes left a comment

Choose a reason for hiding this comment