MoLE

MoLE (Mix-of-Language-Experts) is a novel LLM model architecture for multilingual programming. Quoting the abstract of our LLM4Code'25 paper:

Large language models (LLMs) have demonstrated impressive capabilities in aiding developers with tasks like code comprehension, generation, and translation. Supporting multilingual programming---i.e., coding tasks across multiple programming languages---typically requires either (1) finetuning a single LLM across all programming languages, which is cost-efficient but sacrifices language-specific specialization and performance, or (2) finetuning separate LLMs for each programming language, which allows for specialization but is computationally expensive and storage-intensive due to the duplication of parameters. This paper introduces MoLE (Mix-of-Language-Experts), a novel architecture that balances efficiency and specialization for multilingual programming. MoLE is composed of a base model, a shared LoRA (low-rank adaptation) module, and a collection of language-specific LoRA modules. These modules are jointly optimized during the finetuning process, enabling effective knowledge sharing and specialization across programming languages. During inference, MoLE automatically routes to the language-specific LoRA module corresponding to the programming language of the code token being generated. Our experiments demonstrate that MoLE achieves greater parameter efficiency compared to training separate language-specific LoRAs, while outperforming a single shared LLM finetuned for all programming languages in terms of accuracy.

This repository contains the code for MoLE.

Usage

Prerequisites

Linux operation system
Python 3.9+
CUDA 12.3+

Setup Python environment

The dependencies of MoLE are listed in requirements.txt. You can install them by running (recommended in a virtual environment, e.g., virtualenv):

pip install -r requirements.txt

flash-attn needs to be installed separately. You can follow the instructions at Dao-AILab/flash-attn.

Preparing dataset

Run python3 ./scripts/filter_glaive_dataset.py to download and filter the dataset from HuggingFace. The result is stored at ./datasets/filtered_glaive.

Finetuning

Create a diretory {dir} to contain the artifacts.

For

full baseline, copy ./mole/auto_config.json into {dir}.
LoRA baseline, copy ./mole/lora_config.json into {dir}.
MoLE, copy ./mole/moe_config.json into {dir}.

Set {auto|lora|moe}_config.datasets.path_on_disk to the full path of the processed dataset then start training with python3 ./mole/train_{auto|lora|moe}.py <dir>.

Inference

The tokenizer and finetuned model are saved at {dir}/final. Load them using transformers.AutoTokenizer and transformers.AutoModel.

Citation

If you use MoLE in your work, please cite the following paper:

@inproceedings{ZongETAL25MoLE,
    title={Mix-of-Language-Experts Architecture for Multilingual Programming},
    author={Zong, Yifan and Deng, Yuntian and Nie, Pengyu},
    booktitle={International Workshop on Large Language Models for Code},
    year={2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
mole		mole
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
prepare-env.sh		prepare-env.sh
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MoLE

Table of Contents

Usage

Prerequisites

Setup Python environment

Preparing dataset

Finetuning

Inference

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

uw-swag/mole

Folders and files

Latest commit

History

Repository files navigation

MoLE

Table of Contents

Usage

Prerequisites

Setup Python environment

Preparing dataset

Finetuning

Inference

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages