LoRA-Translation-Example

This is a repository with example scripts for LoRA based SFT translation tuning. The repository contains scripts for the following two frameworks:

Transformers + DeepSpeed: script is provided in transformers_training dir
Megatron-Bridge: script is provided in megatron_bridge_training dir

Data

The data used in experiment is provided in the data dir. The same dataset is used by both libraries, but the data is provided in different formats:

megatron subdir contains data in "standard messages" format, which is used by Megatron-Bridge.
transformers subdir contains data in "prompt-completion" format, which is used in Transformers script.

Environment

We use official NeMo container version 26.02 for Megatron-Bridge. Warning: we made two modifications:

/opt/Megatron-Bridge/src/megatron/bridge/data/datasets/utils.py: we added loss masking based on chat template and messages instead of generation keyword.
/opt/Megatron-Bridge/src/megatron/bridge/models/gemma/gemma3_provider.py: we use the Gemma 3 version from the main branch, which contains some bug fixes. Both modifications are binded into the container in the sbatch script.

For the Transformers + DeepSpeed we provide a Singularity recipe to build a image. The recipe is located at singularity/transformers_deepspeed_recipe.def.

Hardware

The scripts are prepared for the LEONARDO Booster partition.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
megatron_bridge_training		megatron_bridge_training
singularity		singularity
transformers_training		transformers_training
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA-Translation-Example

Data

Environment

Hardware

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LoRA-Translation-Example

Data

Environment

Hardware

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages