Export T5 (encoder-decoder) to ExecuTorch #36486

guangy10 · 2025-03-01T01:48:37Z

What does this PR do?

This PR enables exporting T5 model to ExecuTorch, which has been asked by many OSS users.
We will need to export T5 encoder and decoder separately (i.e. to separate .pte files when lowering to ExecuTorch), and compose the encoder-decoder for specific task (e.g. summarization) in the ExecuTorch runtime. In this PR, I'm demonstrating the impl in python.

The T5 encoder is exported with "encoder_sequence_length" dim being dynamic. The decoder is exported with "encoder_sequence_length_dim" dim being dynamic and with cache support.

Tests:

Test export

RUN_SLOW=1 pytest tests/models/t5/test_modeling_t5.py -s -v -k test_export

Test lower to ExecuTorch

In optimum-executorch patch this WIP PR: huggingface/optimum-executorch#30
Users can just export the T5 model to two separate .pte files (encoder.pte and decoder.pte) and load them to perform the summarization task as simple as following:

model = ExecuTorchModelForSeq2SeqLM.from_pretrained("google-t5/t5-small", recipe="xnnpack")
generated_text = model.text_generation(
    tokenizer=AutoTokenizer.from_pretrained("google-t5/t5-small"),
    prompt="summarize: Simply put, the theory of relativity states that ...",
)

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. T5 is ExecuTorch compatible #33834
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@ArthurZucker @amyeroberts @qubvel

qubvel

Hi @guangy10! I'm excited to see a new model compatible with Executorch! By the way, can we generalize this approach to other encoder-decoder models to avoid creating a specific *Exportable module for each model?

guangy10 · 2025-03-04T19:14:40Z

Hi @guangy10! I'm excited to see a new model compatible with Executorch! By the way, can we generalize this approach to other encoder-decoder models to avoid creating a specific *Exportable module for each model?

Starting with t5 to ensure the model can work e2e with ExecuTorch in optimum-executorch. So the 2nd part on the optimum side is WIP. Yeah, I think this can be generalized for other encoder-decoder text models.

guangy10 · 2025-03-06T05:21:22Z

@qubvel @ArthurZucker I spend lots of time today looking into other seq2seq-lm like BART and Pegasus. Their decoder code are implemented differently than T5, and they don't support Cache object. I'm implementing a different wrapper module for that kind of decoder but running into some constraint violation issue that I need more time to look into. I think that can come in a separate PR once it's ready, basically extending theSeq2SeqLMExportableModule I added in this PR. To unblock the optimum side work, i.e. huggingface/optimum-executorch#30, can I get this PR reviewed. wdyt?

guangy10 · 2025-03-07T21:57:43Z

@GregoryComer @tarun292 @larryliu0820 Can you help review this PR? Basically, I want to standardize the way how an seq2seq-lm should be exported. And later we will need a c++ runtime that can load and run any seq2seq-lm as long as it's exported in the standardized way.

src/transformers/integrations/executorch.py

guangy10 · 2025-03-18T02:18:04Z

@ArthurZucker @qubvel Can I get this one reviewed?

guangy10 · 2025-03-18T17:16:21Z

@qubvel @ArthurZucker Sharing another research on model Hub: https://huggingface.co/models?pipeline_tag=text2text-generation&sort=trending. It shows pretty much all popular and recent variants of Seq2SeqLM are t5-based. So IMO enabling the base t5 via this PR will provide the highest ROI for users to pulling a Seq2SeqLM on-device in their application via ExecuTorch. Here is an example of the request in Discord from a real world user: https://discord.com/channels/1334270993966825602/1334270993966825605/1342205676222414908

I have rename the module to Seq2SeqLMExportableModule which can be extended to support other seq2seq-lm like BART and Pegasus in the future as I proposed in my previous comment

guangy10 · 2025-03-18T17:30:13Z

cc: @tugsbayasgalan

guangy10 · 2025-03-18T18:10:15Z

@ydshieh Do you mind reviewing this PR?

ArthurZucker

Sorry for being slow on my side and thanks for enableing this!~ Indeed a lot of models still use T5, merging as you have been waiting for a while and LGTM

Co-authored-by: Guang Yang <guangyang@fb.com>

guangy10 force-pushed the t5_executorch branch from 8be627f to 0307fba Compare March 1, 2025 01:51

guangy10 marked this pull request as ready for review March 1, 2025 01:51

guangy10 force-pushed the t5_executorch branch from 0307fba to 932354c Compare March 1, 2025 02:11

guangy10 mentioned this pull request Mar 1, 2025

Export to ExecuTorch #32253

Open

33 tasks

guangy10 force-pushed the t5_executorch branch from 932354c to 1aa42d0 Compare March 4, 2025 02:42

qubvel reviewed Mar 4, 2025

View reviewed changes

qubvel added the ExecuTorch label Mar 4, 2025

guangy10 force-pushed the t5_executorch branch from 1aa42d0 to 7bcc124 Compare March 5, 2025 03:27

guangy10 mentioned this pull request Mar 5, 2025

Enable T5 and similar models huggingface/optimum-executorch#30

Closed

Export T5 (encoder-decoder) to ExecuTorch

f5ff0e2

guangy10 force-pushed the t5_executorch branch from 7bcc124 to f5ff0e2 Compare March 6, 2025 05:11

larryliu0820 reviewed Mar 8, 2025

View reviewed changes

src/transformers/integrations/executorch.py Show resolved Hide resolved

This was referenced Mar 19, 2025

Support more text models and tasks huggingface/optimum-executorch#36

Merged

Remove unnecessary attr assignment #36837

Merged

chmjkb mentioned this pull request Mar 26, 2025

Export Whisper to ExecuTorch #37009

Open

5 tasks

ArthurZucker approved these changes Mar 31, 2025

View reviewed changes

ArthurZucker merged commit 3b07ca7 into huggingface:main Mar 31, 2025
13 checks passed

dmdaksh pushed a commit to dmdaksh/transformers that referenced this pull request Apr 2, 2025

Export T5 (encoder-decoder) to ExecuTorch (huggingface#36486)

f510f28

Co-authored-by: Guang Yang <guangyang@fb.com>

zucchini-nlp pushed a commit to BakerBunker/transformers that referenced this pull request Apr 2, 2025

Export T5 (encoder-decoder) to ExecuTorch (huggingface#36486)

d3feb93

Co-authored-by: Guang Yang <guangyang@fb.com>

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025

Export T5 (encoder-decoder) to ExecuTorch (huggingface#36486)

bb0ccb4

Co-authored-by: Guang Yang <guangyang@fb.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Export T5 (encoder-decoder) to ExecuTorch #36486

Export T5 (encoder-decoder) to ExecuTorch #36486

Uh oh!

guangy10 commented Mar 1, 2025 •

edited

Loading

Uh oh!

qubvel left a comment

Uh oh!

guangy10 commented Mar 4, 2025

Uh oh!

guangy10 commented Mar 6, 2025

Uh oh!

guangy10 commented Mar 7, 2025

Uh oh!

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Uh oh!

Export T5 (encoder-decoder) to ExecuTorch #36486

Export T5 (encoder-decoder) to ExecuTorch #36486

Uh oh!

Conversation

guangy10 commented Mar 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Tests:

Test export

Test lower to ExecuTorch

Before submitting

Who can review?

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

guangy10 commented Mar 4, 2025

Uh oh!

guangy10 commented Mar 6, 2025

Uh oh!

guangy10 commented Mar 7, 2025

Uh oh!

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

guangy10 commented Mar 18, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

guangy10 commented Mar 1, 2025 •

edited

Loading