add GPT-J ONNX config to Transformers #16274

chainyo · 2022-03-19T20:12:13Z

What does this PR do?

I'm looking for contributing to Transformers repository by adding more OnnxConfig to available models on the hub.
I have created a little organization ONNXConfig for all to track the models that needs support for ONNX.

This is the first contribution since CamemBERT OnnxConfig some months ago.

I took example on GPT2 and GPT-Neo OnnxConfig but I'm not sure if everything is good or if GPT-J needs special things to be added.

So this PR is a work in progress. If anyone can send me ressources to read to understand if it lacks anything, it would be awesome! 🤗

Who can review?

Models GPT2 / GPT-Neo
@LysandreJik @michaelbenayoun

HuggingFaceDocBuilderDev · 2022-03-19T20:23:44Z

The documentation is not available anymore as the PR was closed or merged.

lewtun

Thank you for adding this very clean implementation @chainyo 🔥 !

The PR looks good and I've just left a few minor suggestions.

I think one useful check would be to see if you're able to export the model with the LM head and do a simple greedy search decoding with ONNX Runtime of some prompt. We've had some users report issues with TensorRT in #15640 and I'm curious to know if the same exists with ONNX Runtime (I don't think so, but nice to check).

src/transformers/models/gptj/configuration_gptj.py

src/transformers/onnx/features.py

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

chainyo · 2022-03-23T16:50:28Z

So what is the next step to make it merged ?

lewtun · 2022-03-23T16:54:35Z

Hey @chainyo the last thing to check is that the slow tests pass with:

RUN_SLOW=1 python -m pytest tests/onnx/test_onnx_v2.py -k "gpt-j"

We only run these on the main branch, so it would be good to know that they pass before we merge.

Apart from that, the PR looks really good - gently pinging @sgugger or @LysandreJik for final approval

sgugger

Thanks for your contribution!

* add GPT-J ONNX config to Transformers * remove token-classification features mapping Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * add question-answering features mapping Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * add GPT2 config init to GPT2 config + copie shebang for fix-copies Co-authored-by: ChainYo <t.chaigneau.tc@gmail.com> Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

lewtun approved these changes Mar 21, 2022

View reviewed changes

src/transformers/models/gptj/configuration_gptj.py Show resolved Hide resolved

src/transformers/onnx/features.py Outdated Show resolved Hide resolved

src/transformers/onnx/features.py Show resolved Hide resolved

chainyo mentioned this pull request Mar 21, 2022

ONNXConfig: Add a configuration for all available models #16308

Closed

chainyo and others added 4 commits March 21, 2022 19:55

add GPT-J ONNX config to Transformers

d0c2729

remove token-classification features mapping

764f0ad

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

add question-answering features mapping

cf49807

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

add GPT2 config init to GPT2 config + copie shebang for fix-copies

56a8507

chainyo force-pushed the add-gptj-onnx-config branch from 2f74d97 to 56a8507 Compare March 21, 2022 18:56

sgugger approved these changes Mar 23, 2022

View reviewed changes

sgugger merged commit 029b0d9 into huggingface:main Mar 23, 2022

chainyo deleted the add-gptj-onnx-config branch March 24, 2022 07:55

chainyo mentioned this pull request Apr 14, 2022

Fix GPT-J onnx conversion #16780

Merged

lewtun mentioned this pull request May 4, 2022

Add support for ONNX-TensorRT conversion for GPT-J6B (and possible bug in rotary embedding) #15640

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add GPT-J ONNX config to Transformers #16274

add GPT-J ONNX config to Transformers #16274

chainyo commented Mar 19, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 19, 2022 •

edited

Loading

lewtun left a comment •

edited

Loading

chainyo commented Mar 23, 2022

lewtun commented Mar 23, 2022

sgugger left a comment

add GPT-J ONNX config to Transformers #16274

add GPT-J ONNX config to Transformers #16274

Conversation

chainyo commented Mar 19, 2022 • edited Loading

What does this PR do?

Who can review?

HuggingFaceDocBuilderDev commented Mar 19, 2022 • edited Loading

lewtun left a comment • edited Loading

Choose a reason for hiding this comment

chainyo commented Mar 23, 2022

lewtun commented Mar 23, 2022

sgugger left a comment

Choose a reason for hiding this comment

chainyo commented Mar 19, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 19, 2022 •

edited

Loading

lewtun left a comment •

edited

Loading