Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix the sequence-parallelism for the dense model architecture #4530

Merged
merged 8 commits into from
Oct 25, 2023

Commits on Oct 17, 2023

  1. fix the sequence-parallelism for the dense models

    Reza Yazdani committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    066644d View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2023

  1. fix the gradient scale for when zero is not enabled

    Reza Yazdani committed Oct 18, 2023
    Configuration menu
    Copy the full SHA
    8d901bf View commit details
    Browse the repository at this point in the history
  2. fix comm group for allreduce

    tohtana committed Oct 18, 2023
    Configuration menu
    Copy the full SHA
    0bb9594 View commit details
    Browse the repository at this point in the history
  3. fix format

    tohtana committed Oct 18, 2023
    Configuration menu
    Copy the full SHA
    aaae994 View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2023

  1. Configuration menu
    Copy the full SHA
    7ae577c View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2023

  1. Configuration menu
    Copy the full SHA
    01ccf33 View commit details
    Browse the repository at this point in the history
  2. Fix formatting

    samadejacobs committed Oct 25, 2023
    Configuration menu
    Copy the full SHA
    568ae5a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    bff46e5 View commit details
    Browse the repository at this point in the history