Remove unnecessary constants for > 2GB ONNX models #1808

fxmarty · 2024-04-10T14:51:36Z

e.g.

(hf-inf) fxmarty@huggingface:~/qwen_onnx$ ls
added_tokens.json       merges.txt                                                   model.onnx               tokenizer_config.json
config.json             _model_layers.0_self_attn_rotary_emb_Constant_5_attr__value  model.onnx_data          tokenizer.json
generation_config.json  _model_layers.0_self_attn_rotary_emb_Constant_attr__value    special_tokens_map.json  vocab.json

_model_layers.0_self_attn_rotary_emb_Constant_5_attr__value and _model_layers.0_self_attn_rotary_emb_Constant_attr__value are actually in model.onnx_data already.

HuggingFaceDocBuilderDev · 2024-04-10T15:12:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* remove some more unnecessary constants for > 2GB ONNX models * remove typo

remove some more unnecessary constants for > 2GB ONNX models

bc08ab4

fxmarty requested review from echarlaix, regisss and michaelbenayoun April 10, 2024 14:51

fxmarty mentioned this pull request Apr 10, 2024

The exported ONNX model of Qwen/Qwen1.5-0.5B-Chat does not produce a cache-enabled model. #1747

Closed

4 tasks

regisss approved these changes Apr 10, 2024

View reviewed changes

remove typo

88db4fa

fxmarty merged commit 4936662 into huggingface:main Apr 10, 2024
41 of 46 checks passed

young-developer pushed a commit to young-developer/optimum that referenced this pull request May 10, 2024

Remove unnecessary constants for > 2GB ONNX models (huggingface#1808)

2e721b3

* remove some more unnecessary constants for > 2GB ONNX models * remove typo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove unnecessary constants for > 2GB ONNX models #1808

Remove unnecessary constants for > 2GB ONNX models #1808

fxmarty commented Apr 10, 2024

HuggingFaceDocBuilderDev commented Apr 10, 2024

Remove unnecessary constants for > 2GB ONNX models #1808

Remove unnecessary constants for > 2GB ONNX models #1808

Conversation

fxmarty commented Apr 10, 2024

HuggingFaceDocBuilderDev commented Apr 10, 2024