phi2 conversion/optimization script by gh-yewang · Pull Request #19338 · microsoft/onnxruntime

gh-yewang · 2024-01-30T22:21:40Z

Description

This PR adds
onnx conversion script for dynamo exported phi2,
optimization script,
and inference example script

A readme file is added as documentation. https://github.com/microsoft/onnxruntime/tree/wangye/phi2_doc/onnxruntime/python/tools/transformers/models/phi2#readme

Motivation and Context

…to wangye/phi2_doc

…hi2_doc

github-advanced-security

CodeQL found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

github-advanced-security

lintrunner found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

…hi2_doc

gh-yewang · 2024-02-02T01:57:37Z

TODO: add an option to export a model for vllm

justinchuby · 2024-02-02T19:13:42Z

+
+    def unroll_function(self, func_name: str) -> None:
+        """
+        Unrolls the function with the given name in the model.


Can this be done with the onnx inliner? https://onnx.ai/onnx/api/inliner.html#inline-selected-functions

unfortunately no

Was there any limitations? Could you share more? Thanks!

first, the function unrolling is semantically different from inliner. the former does not inline the function recursively. second, I played with inliner initially, but the output onnx file was invalid. you can take a try with the original model exported from dynamo

cc @gramalingam @BowenBao

Re. the first point: I guess you mean you want to do selective inlining (may be just a single function)? (The inliner API does support that.) Re. the second point: where can I find the original model? Thanks!

onnx inliner has an argument function_ids to specify which function to inline by function (domain, name). For the invalid output onnx issue, did you try with latest onnx release? @wangyems

@BowenBao I tried the function_ids at the very beginning a few weeks ago but let me try with the main branch
@gramalingam you can get the original file by running convert_to_onnx.py without argument https://github.com/microsoft/onnxruntime/tree/wangye/phi2_doc/onnxruntime/python/tools/transformers/models/phi2#readme.

…hi2_doc

### Description  1. add option to export onnx compatiable with ort_vllm. This makes sure that onnx model only leverages on paged attn from vllm. It's intended to use internally so not mentioned in readme. 2. add details in ORT installation(#19338 (comment)) ### Motivation and Context  --------- Co-authored-by: wejoncy <wejoncy@163.com>

1. add option to export onnx compatiable with ort_vllm. This makes sure that onnx model only leverages on paged attn from vllm. It's intended to use internally so not mentioned in readme. 2. add details in ORT installation(#19338 (comment))  --------- Co-authored-by: wejoncy <wejoncy@163.com>

### Description  1. add option to export onnx compatiable with ort_vllm. This makes sure that onnx model only leverages on paged attn from vllm. It's intended to use internally so not mentioned in readme. 2. add details in ORT installation(microsoft/onnxruntime#19338 (comment)) ### Motivation and Context  --------- Co-authored-by: wejoncy <wejoncy@163.com>

gh-yewang and others added 25 commits January 21, 2024 02:07

init

b0331c4

update

bf9334a

update

3130460

update

4aed9b0

update

97ee6ee

update

990c1da

update

15324e5

update

013f551

update

eed75b8

update

f95d021

update

5e48dc6

update

0d796fa

Update README.md

ad42a82

add examples

4a5232b

Merge branch 'wangye/phi2_doc' of github.com:microsoft/onnxruntime in…

80f83a8

…to wangye/phi2_doc

Update README.md

5eb7903

Update README.md

1489dc7

Update README.md

76ba850

Update README.md

36d2644

update

32aa259

update

5fc6ddd

update

298f458

Merge branch 'main' of github.com:microsoft/onnxruntime into wangye/p…

f7a80dd

…hi2_doc

update

c588b44

Update README.md

6323852

gh-yewang marked this pull request as ready for review January 30, 2024 22:24

Update README.md

1249acf

github-advanced-security AI found potential problems Jan 30, 2024

View reviewed changes

fix link

7ec7ac1

add op statistics

83d7d4b

tianleiwu reviewed Feb 1, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/symbolic_shape_infer.py Outdated

refactor

cf8aa62

github-advanced-security AI found potential problems Feb 2, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/dynamo_onnx_helper.py Fixed

github-advanced-security AI found potential problems Feb 2, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/dynamo_onnx_helper.py Fixed

mention memory limit in doc

35b0dd2

tianleiwu reviewed Feb 2, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/models/phi2/README.md Outdated

Merge branch 'main' of github.com:microsoft/onnxruntime into wangye/p…

b6679c7

…hi2_doc

tianleiwu reviewed Feb 2, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/models/phi2/README.md

gh-yewang and others added 2 commits February 1, 2024 17:02

fix link

f8b82ad

install torch cu118

d2e0a28

tianleiwu reviewed Feb 2, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/models/phi2/README.md Outdated

tianleiwu reviewed Feb 2, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/models/phi2/requirements.txt Outdated

justinchuby reviewed Feb 2, 2024

View reviewed changes

gh-yewang and others added 3 commits February 2, 2024 20:34

shape infer change

89fa252

Update README.md

3ccb31b

Update requirements.txt

2651977

gh-yewang requested a review from tianleiwu February 2, 2024 21:07

gh-yewang added 2 commits February 2, 2024 21:10

fix docs type

c51d59d

Merge branch 'main' of github.com:microsoft/onnxruntime into wangye/p…

7eb32ce

…hi2_doc

tianleiwu reviewed Feb 3, 2024

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/models/phi2/README.md

tianleiwu approved these changes Feb 3, 2024

View reviewed changes

gh-yewang merged commit aaf32fb into main Feb 5, 2024

gh-yewang deleted the wangye/phi2_doc branch February 5, 2024 18:15

gh-yewang mentioned this pull request Feb 6, 2024

Add script to convert phi2 to ort-vllm compatible #19429

Merged

Conversation

gh-yewang commented Jan 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

github-advanced-security AI left a comment

Choose a reason for hiding this comment

Uh oh!

github-advanced-security AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gh-yewang commented Feb 2, 2024

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

gh-yewang commented Jan 30, 2024 •

edited

Loading