Megatron Lite

Megatron Lite is an experimental training runtime and model implementation layer for Megatron. The source lives under experimental/lite/megatron/lite, and the public import path is megatron.lite.

Do not import experimental.lite from user code. Examples and public APIs should refer to megatron.lite.

Scope

This initial drop contains:

A lightweight runtime API in megatron.lite.runtime.
Common training primitives in megatron.lite.primitive.
Lite-only model implementations for Qwen3 MoE and Qwen3.5 MoE.
Hugging Face safetensors load/export helpers for the included models.
Megatron-Core optimizer wrapping for the lite runtime.

This initial drop intentionally does not include:

Hybrid model implementations.
Bridge model/runtime implementations.
FSDP2 optimizer primitives.
Benchmark entrypoints or experiment scripts.

Layout

experimental/lite/
  README.md
  docs/                       Design and usage notes
  megatron/
    lite/
      runtime/                Runtime API, config, backend registry, lite backend
      model/                  Model registry and Qwen model implementations
      primitive/              Parallel, checkpoint, optimizer, module, and op primitives

For local source-tree use:

export PYTHONPATH=/path/to/Megatron-LM/experimental/lite:$PYTHONPATH

Public API

from megatron.lite.runtime import LiteConfig, RuntimeConfig, create_runtime

cfg = RuntimeConfig(
    backend="lite",
    hf_path="/path/to/hf-model",
    backend_cfg=LiteConfig(model_name="qwen3", impl="lite"),
)
runtime = create_runtime(cfg)
handle = runtime.build_model()

Model names currently registered by default:

qwen3: Qwen3 MoE lite implementation. HF model_type values qwen3_moe and qwen2_moe resolve to this model name.
qwen3_moe: compatibility alias for the same Qwen3 MoE lite implementation.
qwen3_5: Qwen3.5 MoE lite implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 8,874 Commits
.agents		.agents
.claude		.claude
.github		.github
.gitlab		.gitlab
docker		docker
docs		docs
examples		examples
experimental/lite		experimental/lite
images		images
megatron		megatron
scripts		scripts
skills		skills
tasks		tasks
tests		tests
tools		tools
.coderabbit.yaml		.coderabbit.yaml
.cursorrules		.cursorrules
.flake8		.flake8
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.python-version		.python-version
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
SECURITY.md		SECURITY.md
codecov.yml		codecov.yml
functional_tests.md		functional_tests.md
gpt_builders.py		gpt_builders.py
greptile.json		greptile.json
hybrid_builders.py		hybrid_builders.py
mamba_builders.py		mamba_builders.py
model_provider.py		model_provider.py
pretrain_bert.py		pretrain_bert.py
pretrain_gpt.py		pretrain_gpt.py
pretrain_hybrid.py		pretrain_hybrid.py
pretrain_mamba.py		pretrain_mamba.py
pretrain_t5.py		pretrain_t5.py
pretrain_vlm.py		pretrain_vlm.py
pyproject.toml		pyproject.toml
setup.py		setup.py
train_rl.py		train_rl.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Megatron Lite

Scope

Layout

Public API

Docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Megatron Lite

Scope

Layout

Public API

Docs

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages