moe

[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal, Shiwei Liu, Zhangyang Wang

transformer dropout moe self-slimmable

Updated Feb 28, 2023
Python

yo-ru / moe-bot

Sponsor

Star

Meet Moe, a discord bot, written in modern python!

discord discord-bot discord-api moe discord-py

Updated Apr 24, 2023
Python

facebookresearch / AdaTT

Star

pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"

moe mtl multitask-learning

Updated Aug 15, 2023
Python

phanirithvij / twist.moe

Star

Batch download high quality videos from https://twist.moe

anime moe anime-downloader twist-moe twist-moe-downloader

Updated Sep 30, 2023
Python

xrsrke / pipegoose

Star

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

transformers moe data-parallelism distributed-optimizers model-parallelism megatron mixture-of-experts pipeline-parallelism huggingface-transformers megatron-lm tensor-parallelism large-scale-language-modeling 3d-parallelism zero-1 sequence-parallelism

Updated Dec 14, 2023
Python

open-compass / MixtralKit

Star

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

moe mistral llm

Updated Dec 15, 2023
Python

pjlab-sys4nlp / llama-moe

Star

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

moe llama mixture-of-experts llm continual-pre-training expert-partition

Updated Feb 26, 2024
Python

przemub / anime_quiz

Star

Anime Themes Quiz for people with taste.

anime moe quiz

Updated Mar 24, 2024
Python

sijinkim / MEPSNet_dev

Star

Restoring Spatially-Heterogeneous Distortions using Mixture of Experts Network (ACCV 2020)

moe super-resolution

Updated Mar 30, 2024
Python

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

lm moe

Updated Apr 10, 2024
Python

JunweiZheng93 / MATERobot

Star

Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments" at ICRA 2024, Best Paper Finalist on Human-Robot Interaction

moe multi-task-learning material-recognition wearable-robot real-time-vit

Updated Apr 11, 2024
Python

mrzjy / expert_choice_visualization_for_mixtral

Star

A simple project that help visualize expert router choices for text generation

visualization router text-generation transformer moe expert huggingface-transformers large-language-models llm-inference mixtral-8x7b

Updated Apr 17, 2024
Python

davidmrau / mixture-of-experts

Star

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

pytorch moe re-implementation mixture-of-experts sparsely-gated-mixture-of-experts

Updated Apr 19, 2024
Python

kyegomez / MHMoE

Sponsor

Star

Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch

machine-learning ai ml transformers artificial-intelligence moe attention chicken

Updated Apr 27, 2024
Python

ymcui / Chinese-Mixtral

Star

中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）

nlp moe 64k mixture-of-experts 32k large-language-models llm mixtral

Updated Apr 30, 2024
Python

ednial0zavlare / MixKABRN

Star

This is the repo for the MixKABRN Neural Network (Mixture of Kolmogorov-Arnold Bit Retentive Networks), and an attempt at first adapting it for training on text, and later adjust it for other modalities.

ai neural-network model architecture moe bitnet mixture-of-experts ai-models llms retnet retentive-network kolmogorov-arnold-networks

Updated May 14, 2024
Python

Improve this page

Add a description, image, and links to the moe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the moe topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

moe

Here are 30 public repositories matching this topic...

almajiro / dotfiles

YeonwooSung / Pytorch_mixture-of-experts

Fukuda-B / kawaii_voice_gtts

voytex / MoE_network_driver

VITA-Group / Random-MoE-as-Dropout

yo-ru / moe-bot

facebookresearch / AdaTT

phanirithvij / twist.moe

xrsrke / pipegoose

open-compass / MixtralKit

pjlab-sys4nlp / llama-moe

przemub / anime_quiz

sijinkim / MEPSNet_dev

IBM / ModuleFormer

JunweiZheng93 / MATERobot

mrzjy / expert_choice_visualization_for_mixtral

davidmrau / mixture-of-experts

kyegomez / MHMoE

ymcui / Chinese-Mixtral

ednial0zavlare / MixKABRN

Improve this page

Add this topic to your repo