-
Notifications
You must be signed in to change notification settings - Fork 95
Issues: microsoft/mup
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Reproducing the validation accuracy vs learning rates curve on ResNet
#67
opened Dec 21, 2023 by
liulei277
_rescale_parameters() inconsistent with the paper for the tied embedding scenario?
#55
opened Jul 12, 2023 by
ofivite
Warmup schedule when changing the number of tokens/steps (GPT-3 experiment detail)
#51
opened Jun 6, 2023 by
sashaDoubov
Are Sequentials with list comprehension handled incorrectly?
#43
opened Apr 17, 2023 by
RobertBaruch
Should
base=None
be used in set_base_shapes
for model used for tuning?
#25
opened Nov 3, 2022 by
callumm-graphcore
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.