Popular repositories Loading
-
deepseek-v4-mini-pytorch
deepseek-v4-mini-pytorch PublicImplement the DeepSeek-V4 architecture from scratch using PyTorch for efficient research, controlled ablations, and study of sparse MoE and long-context mechanisms.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.