Skip to content

Conversation

@PT-10
Copy link
Contributor

@PT-10 PT-10 commented Jul 2, 2025

Reference - #473
Included simple question on implementation of SwiGLU activation function as Q151

Copy link
Collaborator

@moe18 moe18 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

had an issue with one of the test cases, also make this question 156

"expected_output": "[[1.7616, 0.2384]]"
},
{
"test": "print(np.round(SwiGLU(np.array([[1, -1, 1000, -1000]])), 4))",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have removed this test in production because the warning makes the test fail
RuntimeWarning: overflow encountered in exp return 1 / (1 + np.exp(-x))

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated

Copy link
Collaborator

@moe18 moe18 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good

@moe18 moe18 merged commit 7939971 into Open-Deep-ML:main Jul 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants