Skip to content

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

Notifications You must be signed in to change notification settings

imagination-research/EEP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

EEP

The official code of paper "Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs". We are still rearranging our code. We will soon release it.

About

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published