Working on NSA based 1B-2B LLM Pretraining Project..
11785 F25 Team 23
Final Project on NSA based LLM
Popular repositories Loading
Repositories
Showing 3 of 3 repositories
- molf Public
Reference implementation for "Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation"
11785T23/molf’s past year of commit activity - peft Public
Modified version of huggingface/peft (https://github.com/huggingface/peft) for efficient LoRA fine-tuning.
11785T23/peft’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…