Skip to content

Conversation

@RezaYazdaniAminabadi
Copy link
Contributor

This PR addresses #581

template <typename T>
class BertTransformerLayer {
public:
BertTransformerLayer(int layer_id,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the layer id can also be set to unsigned.

@rocm-mici
Copy link

Can one of the admins verify this patch?

@FarzanT
Copy link
Contributor

FarzanT commented Dec 16, 2022

Hi, was this issue fixed? LayerNorm combined with DeepSpeed FP16 still seems to be problematic.

@molly-smith molly-smith self-assigned this Aug 18, 2023
@loadams loadams closed this Sep 21, 2023
@loadams loadams reopened this Sep 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants