moe

Here are 30 public repositories matching this topic...

mrzjy / expert_choice_visualization_for_mixtral

A simple project that help visualize expert router choices for text generation

visualization router text-generation transformer moe expert huggingface-transformers large-language-models llm-inference mixtral-8x7b

Updated Apr 17, 2024
Python

ednial0zavlare / MixKABRN

Star

This is the repo for the MixKABRN Neural Network (Mixture of Kolmogorov-Arnold Bit Retentive Networks), and an attempt at first adapting it for training on text, and later adjust it for other modalities.

ai neural-network model architecture moe bitnet mixture-of-experts ai-models llms retnet retentive-network kolmogorov-arnold-networks

Updated May 14, 2024
Python

almajiro / dotfiles

Star

my awesome dotfiles

linux dotfiles anime moe

Updated May 23, 2019
Python

AllenHW / JAX-MoE

Star

A reference implementation of MoE LLM in Jax and Haiku

moe haiku jax llm

Updated Jun 6, 2024
Python

kyegomez / MHMoE

Sponsor

Star

Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch

machine-learning ai ml transformers artificial-intelligence moe attention chicken

Updated Apr 27, 2024
Python

yo-ru / moe-bot

Sponsor

Star

Meet Moe, a discord bot, written in modern python!

discord discord-bot discord-api moe discord-py

Updated Apr 24, 2023
Python

przemub / anime_quiz

Star

Anime Themes Quiz for people with taste.

anime moe quiz

Updated Mar 24, 2024
Python

sijinkim / MEPSNet_dev

Star

Restoring Spatially-Heterogeneous Distortions using Mixture of Experts Network (ACCV 2020)

moe super-resolution

Updated Mar 30, 2024
Python

voytex / MoE_network_driver

Star

Network editor for MoE devices

python midi moe

Updated Dec 9, 2022
Python

JunweiZheng93 / MATERobot

Star

Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments" at ICRA 2024, Best Paper Finalist on Human-Robot Interaction

moe multi-task-learning material-recognition wearable-robot real-time-vit

Updated Apr 11, 2024
Python

kyegomez / LIMoE

Sponsor

Star

Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts"

machine-learning ai tensorflow ml pytorch artificial-intelligence moe swarms mixture-of-experts

Updated May 18, 2024
Python

james-oldfield / muMoE

Star

[arXiv'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

moe mixture-of-experts

Updated May 31, 2024
Python

kyegomez / MoE-Mamba

Sponsor

Star

Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Zeta

ai ml moe swarms multi-modality multi-modal-fusion

Updated May 18, 2024
Python

YeonwooSung / Pytorch_mixture-of-experts

Sponsor

Star

PyTorch implementation of moe, which stands for mixture of experts

moe mixture-of-experts

Updated Feb 11, 2021
Python

Fukuda-B / kawaii_voice_gtts

Star

Audio conversion module

moe kawaii boice

Updated Mar 22, 2022
Python

VITA-Group / Random-MoE-as-Dropout

Star

[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal, Shiwei Liu, Zhangyang Wang

transformer dropout moe self-slimmable

Updated Feb 28, 2023
Python

LINs-lab / DynMoE

Star

[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

moe bert mixture-of-experts vision-transformer stablelm multimodal-large-language-models qwen phi-2

Updated Jun 4, 2024
Python

facebookresearch / AdaTT

Star

pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"

moe mtl multitask-learning

Updated Aug 15, 2023
Python

kyegomez / SwitchTransformers

Sponsor

Star

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

ai ml moe llama multi-modal mixture-model mixture-of-experts mixture-of-models gpt4

Updated May 17, 2024
Python

IBM / ModuleFormer

Star

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

lm moe

Updated Apr 10, 2024
Python

Improve this page

Add a description, image, and links to the moe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the moe topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

moe

Here are 30 public repositories matching this topic...

mrzjy / expert_choice_visualization_for_mixtral

ednial0zavlare / MixKABRN

almajiro / dotfiles

AllenHW / JAX-MoE

kyegomez / MHMoE

yo-ru / moe-bot

przemub / anime_quiz

sijinkim / MEPSNet_dev

voytex / MoE_network_driver

JunweiZheng93 / MATERobot

kyegomez / LIMoE

james-oldfield / muMoE

kyegomez / MoE-Mamba

YeonwooSung / Pytorch_mixture-of-experts

Fukuda-B / kawaii_voice_gtts

VITA-Group / Random-MoE-as-Dropout

LINs-lab / DynMoE

facebookresearch / AdaTT

kyegomez / SwitchTransformers

IBM / ModuleFormer

Improve this page

Add this topic to your repo