Skip to content
This repository has been archived by the owner on Sep 12, 2023. It is now read-only.

New Models #8

Open
StellaAthena opened this issue Feb 3, 2022 · 0 comments
Open

New Models #8

StellaAthena opened this issue Feb 3, 2022 · 0 comments

Comments

@StellaAthena
Copy link

GPT-NeoX 20B is a new language model by EleutherAI trained on the Pile. It is a decoder model that is competent at both English and generating code.

The training code can be found here and the model weights will be released next week. It was trained using PyTorch and DeepSpeed on 96 A100 @ CoreWeave. You can find the compute states here... sorry, you'll have to do the math for the total compute yo

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant