feat: add Magpie TTS CoreML conversion pipeline by Alex-Wengg · Pull Request #24 · FluidInference/mobius

Alex-Wengg · 2026-03-13T19:36:36Z

Summary

Add NVIDIA Magpie TTS Multilingual (357M) CoreML conversion pipeline as a submodule
Complete 4-model pipeline: text encoder, decoder prefill, decoder step (AR), NanoCodec vocoder
9 languages (en, es, de, fr, it, vi, zh, hi, ja), 5 built-in speakers, float16 CoreML
Includes export scripts for embeddings, tokenizers, local transformer weights, and pypinyin/OpenJTalk dictionaries
Pure CoreML inference script (generate_coreml.py) and PyTorch reference (generate_pytorch.py)

Pipeline

Model	Purpose
`text_encoder`	Text → conditioning vectors
`decoder_prefill`	Batch speaker context into KV cache
`decoder_step`	Single AR step with KV cache (~50-200x per utterance)
`nanocodec_decoder`	Codec tokens → 22kHz audio

Source

Model: nvidia/magpie_tts_multilingual_357m
Submodule: smdesai/MagpieTTS

NVIDIA Magpie TTS Multilingual (357M) conversion to CoreML. Pipeline (4 models): - text_encoder: text tokenization and encoding - decoder_prefill: batch speaker context into KV cache - decoder_step: single AR step with KV cache - nanocodec_decoder: codec tokens to 22kHz audio 9 languages (en, es, de, fr, it, vi, zh, hi, ja), 5 speakers. Includes conversion scripts, traceable wrappers, export scripts for embeddings/tokenizers/weights, and CoreML inference script. Source: nvidia/magpie_tts_multilingual_357m

- Fix constants_dir path to use single dirname (script is in coreml/ not coreml/convert/) - Move pyproject.toml and uv.lock to coreml/ directory to follow AGENTS.md structure Fixes Devin review findings: - BUG_pr-review-job-b93938a5c0ea4e7897c7782fcd2dbe59_0002 - BUG_pr-review-job-b93938a5c0ea4e7897c7782fcd2dbe59_0003

All three conversion scripts had incorrect sys.path.insert() that went up two directories instead of one, causing ModuleNotFoundError at runtime. Fixed files: - convert_decoder_prefill.py:18 - convert_decoder_step.py:14 - convert_text_encoder.py:14 Changed from dirname(dirname(__file__)) to dirname(__file__) to correctly resolve to coreml/ directory where traceable/ package lives. Addresses Devin review findings in PR #24

devin-ai-integration

Devin Review found 2 new potential issues.

⚠️ 2 issues in files not directly in the diff

⚠️ README references non-existent `convert/` subdirectory for conversion scripts (`models/tts/magpie/coreml/README.md:79-83`)

The README instructs users to run python convert/convert_text_encoder.py, python convert/convert_decoder_prefill.py, etc. (lines 79-83), and lists a "Conversion Scripts (convert/)" section (line 195). However, these files actually live at the root of the coreml/ directory (e.g., convert_text_encoder.py), not in a convert/ subdirectory — the convert/ directory does not exist. Users following these instructions will get FileNotFoundError. The docstrings inside each convert script also reference the wrong convert/ path (e.g., convert_decoder_prefill.py:7, convert_decoder_step.py:4, convert_nanocodec.py:10, convert_text_encoder.py:4).

⚠️ README references non-existent `extras/` path for export_pypinyin.py (`models/tts/magpie/coreml/README.md:158-160`)

The README Mandarin file table (lines 158-160) references extras/export_pypinyin.py as the generator for mandarin_jieba_dict.json, mandarin_pypinyin_char_dict.json, and mandarin_pypinyin_phrase_dict.json. However, the file is at export_pypinyin.py (root of coreml/), not in an extras/ subdirectory. The script's own docstring (export_pypinyin.py:11) also references the wrong path extras/export_pypinyin.py. Users following these instructions will get a FileNotFoundError.

View 12 additional findings in Devin Review.

- Remove duplicate os.makedirs()/mlmodel.save() in convert_decoder_step.py:90-94 - Fix README to reference coreml/ instead of non-existent convert/ subdirectory - Fix README to reference coreml/export_pypinyin.py instead of extras/ Addresses remaining Devin review findings in PR #24

devin-ai-integration

Devin Review found 1 new potential issue.

View 12 additional findings in Devin Review.

devin-ai-integration · 2026-03-21T02:35:00Z

+        super().__init__()
+        self.snake_channels = original.snake_channels
+        self.snake_act = TraceableSnake(original.snake_act)
+        self.lrelu = nn.LeakyReLU()


🔴 TraceableHalfSnake uses default LeakyReLU slope (0.01) instead of copying the original module's slope

In TraceableHalfSnake.__init__, a fresh nn.LeakyReLU() is created with PyTorch's default negative_slope=0.01, rather than copying the original HalfSnake module's lrelu attribute. NanoCodec is a BigVGAN/HiFi-GAN-based vocoder where the standard LRELU_SLOPE is 0.1 — a 10x difference. The code correctly copies snake_channels and wraps snake_act from original, making the omission of original.lrelu an oversight. Since HalfSnake applies LeakyReLU to half the channels in every activation layer, the wrong slope silently degrades the converted NanoCodec decoder's audio quality.

Suggested change

self.lrelu = nn.LeakyReLU()

self.lrelu = original.lrelu if hasattr(original, 'lrelu') else nn.LeakyReLU()

Was this helpful? React with 👍 or 👎 to provide feedback.

This comment was marked as resolved.

Sign in to view

Alex-Wengg force-pushed the feat/magpie-tts branch from 3be1e15 to d161a71 Compare March 13, 2026 19:45

This comment was marked as resolved.

Sign in to view

Alex-Wengg force-pushed the feat/magpie-tts branch from d161a71 to c04cbcb Compare March 13, 2026 19:50

This comment was marked as resolved.

Sign in to view

Alex-Wengg marked this pull request as draft March 15, 2026 15:17

Alex-Wengg marked this pull request as ready for review March 21, 2026 01:40

This comment was marked as resolved.

Sign in to view

devin-ai-integration Bot reviewed Mar 21, 2026

View reviewed changes

Alex-Wengg merged commit f3e8223 into main Mar 21, 2026

Alex-Wengg deleted the feat/magpie-tts branch March 21, 2026 02:40

Alex-Wengg mentioned this pull request Apr 25, 2026

feat(tts/magpie): add NVIDIA Magpie TTS Multilingual 357M Swift port FluidInference/FluidAudio#541

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Magpie TTS CoreML conversion pipeline#24

feat: add Magpie TTS CoreML conversion pipeline#24
Alex-Wengg merged 4 commits intomainfrom
feat/magpie-tts

Alex-Wengg commented Mar 13, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	self.lrelu = nn.LeakyReLU()
	self.lrelu = original.lrelu if hasattr(original, 'lrelu') else nn.LeakyReLU()

Conversation

Alex-Wengg commented Mar 13, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Pipeline

Source

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

⚠️ README references non-existent convert/ subdirectory for conversion scripts (models/tts/magpie/coreml/README.md:79-83)

⚠️ README references non-existent extras/ path for export_pypinyin.py (models/tts/magpie/coreml/README.md:158-160)

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Alex-Wengg commented Mar 13, 2026 •

edited by devin-ai-integration Bot

Loading

⚠️ README references non-existent `convert/` subdirectory for conversion scripts (`models/tts/magpie/coreml/README.md:79-83`)

⚠️ README references non-existent `extras/` path for export_pypinyin.py (`models/tts/magpie/coreml/README.md:158-160`)