Cherry Pick for ORT 1.7.0 by oliviajain · Pull Request #6812 · microsoft/onnxruntime

oliviajain · 2021-02-25T19:58:07Z

Cherry-picks

Fix longformer parity and perf regression (Fix longformer parity and perf regression #6760) …
Adding fp16 support for Einsum Cuda kernel (Adding fp16 support for Einsum Cuda kernel #6775) …
Update DirectML 1.4.1 to 1.4.2 for ORT 1.7 (Update DirectML 1.4.1 to 1.4.2 for ORT 1.7 #6780) …
Fix regression in constant folding optimizer (Fix regression in constant folding optimizer #6795)
Update transformers benchmark for transformers 4.3.* and ORT 1.7 (Update transformers benchmark for transformers 4.3.* and ORT 1.7 #6796) …
Make keepdims to its default value when adding ReduceMin/ReduceMax fo (Make keepdims to its default value when adding ReduceMin/ReduceMax for quantization calibration #6788)… …
fix issues caused by quantize/calibrate changes (Update TensorRT INT8 resnet example #6802)

6735 and 6728 already in release branch

* add fast kernel back, update benchmark and conversion scripts

* checkin einsum fp16 support * remove unnecessary code * add tests * add another test

Co-authored-by: Ori Levari <orlevari@microsoft.com>

* update benchmark for transformers 4.* and ORT 1.7 * Fix gpt2 onnx conversion for transformers 4.3.*. Add a check of transformer version >= 3.1. * remove code related to openmp * update pretrain model list: keep representitive models only

…r quantization calibration (#6788) * Make keepdims to its default value when adding ReduceMin/ReduceMax * Fix bug for adding ReduceMin/ReduceMax with keepdims=1

tianleiwu and others added 7 commits February 25, 2021 11:21

Fix longformer parity and perf regression (#6760)

240e0d5

* add fast kernel back, update benchmark and conversion scripts

Adding fp16 support for Einsum Cuda kernel (#6775)

2ebf145

* checkin einsum fp16 support * remove unnecessary code * add tests * add another test

Update DirectML 1.4.1 to 1.4.2 for ORT 1.7 (#6780)

7fbb9c1

Co-authored-by: Ori Levari <orlevari@microsoft.com>

Fix regression in constant folding optimizer (#6795)

b51d37e

Make keepdims to its default value when adding ReduceMin/ReduceMax fo…

02e12cb

…r quantization calibration (#6788) * Make keepdims to its default value when adding ReduceMin/ReduceMax * Fix bug for adding ReduceMin/ReduceMax with keepdims=1

fix issues caused by quantize/calibrate changes (#6802)

51d6e13

oliviajain added the release:1.7 label Feb 25, 2021

oliviajain requested review from faxu and pranavsharma February 25, 2021 19:58

oliviajain requested a review from a team as a code owner February 25, 2021 19:58

pranavsharma approved these changes Feb 25, 2021

View reviewed changes

RandySheriffH approved these changes Feb 25, 2021

View reviewed changes

oliviajain merged commit 4bbea91 into rel-1.7.0 Feb 25, 2021

oliviajain deleted the oljain/cherry_pick branch February 25, 2021 22:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cherry Pick for ORT 1.7.0#6812

Cherry Pick for ORT 1.7.0#6812
oliviajain merged 7 commits intorel-1.7.0from
oljain/cherry_pick

oliviajain commented Feb 25, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

oliviajain commented Feb 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

oliviajain commented Feb 25, 2021 •

edited

Loading