The official code of paper "Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs". We are still rearranging our code. We will soon release it.
-
Notifications
You must be signed in to change notification settings - Fork 0
imagination-research/EEP
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
About
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published