Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding support for more model architectures #47

Merged
merged 15 commits into from
Mar 1, 2023
Merged

Conversation

BlackSamorez
Copy link
Owner

No description provided.

@BlackSamorez BlackSamorez self-assigned this Feb 24, 2023
@BlackSamorez BlackSamorez linked an issue Feb 24, 2023 that may be closed by this pull request
5 tasks
@BlackSamorez
Copy link
Owner Author

Added predefined configs for gpt2 and bert.

@BlackSamorez
Copy link
Owner Author

Refactored head splitting such that it can't assign zero heads to a slice. Instead it returns a tensor of zeros as if one head was assigned to it.
It's necessary because tensors of size 0 usually break split, reshapes, etc...

Copy link
Collaborator

@justheuristic justheuristic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀

@BlackSamorez BlackSamorez merged commit 2f8c065 into main Mar 1, 2023
@BlackSamorez BlackSamorez changed the title [WIP] Adding support for more model architectures Adding support for more model architectures Mar 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add more predefined configs
2 participants