GPT2 one step beam search update with configuration support by xi-liu-ds · Pull Request #7425 · microsoft/onnxruntime

xi-liu-ds · 2021-04-22T21:37:27Z

Description: In this PR we add update one-step beam search with supporting early stopping, temperature, repetition penalty, length penalty, excluded token ids and sampling in ONNX graph.

Motivation and Context

Customers would like some configuration support in beam search. This PR will fulfill this purpose by adding GPT2LMHeadModel_BeamSearchStepConfiguration class with the following configuration included directly into ONNX compute graph.
- early stopping finished beams
- temperature
- repetition penalty
- length penalty
- excluded token ids
- sampling
- ignore end of sentence token in model inference

How to run
In bash/CMD/PowerShell, run:

python onnxruntime/python/tools/transformers/convert_to_onnx.py \
--model_name_or_path=[path/to/model/folder] --model_class=GPT2LMHeadModel_BeamSearchStepConfiguration--output=[path/to/output/onnx_file_name] -o --precision=int8

and optionally with --input_test_file=[path/to/test/file] and setting different configuration flags (--ignore_eos, --repetition_penalty, --temperature, --excluded_token_ids, --length_penalty, --do_sample, --do_sample_top_p, --do_sample_top_k)

ONNX with beam search refactoring

tianleiwu · 2021-04-23T00:19:53Z

Please add some tests. For example, in test_gpt2.py?

tianleiwu · 2021-04-23T18:10:06Z

+    def top_k_top_p_filtering(log_probs, top_p=1.0, top_k=0):
+        '''Set tail event (out of top_p) to a big negative number'''
+        sorted_log_probs, sorted_indices = torch.sort(log_probs, descending=True)
+        cumulative_probs = torch.cumsum(sorted_log_probs.exp(), dim=-1)


CumSum for float16 need onnx opset version 14. User will encounter issue in float16 model right now since pytorch and ORT does not support opset 14 right now.

I have tested on Pytorch and ORT 1.7.0 and it works fine. Could you clarify? Thanks

xi-liu-ds · 2021-04-26T05:38:38Z

Please add some tests. For example, in test_gpt2.py?

Done

…o xi-liu-ds/beam_search_update

Xiaoyu Liu and others added 16 commits March 1, 2021 23:15

beam search refactoring checkin

48221f4

remove unnecessary format change

a7813da

add factory class and deduplicate code

b1f0a2a

clean up format

a03e4e1

Merge pull request #2 from xi-liu-ds/xi-liu-ds/bs

75c47c0

ONNX with beam search refactoring

Merge remote-tracking branch 'origin/master'

b279816

Merge remote-tracking branch 'upstream/master'

2f05423

align with latest onnxruntime and address comments

bd3c606

fix to fp32

a01aca0

one step beam search works on gpu

c701be6

beam search with selected unfinished beams init

32af341

adjust inputs and outputs

26c2f80

update io-binding

e51ad38

tweaks early stopping condition

806c499

further tweaks

c8629c0

check in early stop search as seperate type

fd1bfbd

xi-liu-ds requested a review from a team as a code owner April 22, 2021 21:37

Merge branch 'master' into xi-liu-ds/beam_search_update

9b6b502

xi-liu-ds changed the title ~~GPT2 one step beam search update with early stopping~~ GPT2 one step beam search update with configuration support Apr 22, 2021

Xiaoyu Liu added 2 commits April 22, 2021 22:07

rename to beam search configurations

79483b3

update do sample configuration flag help

32ca206

tianleiwu reviewed Apr 23, 2021

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/convert_to_onnx.py Outdated

microsoft deleted a comment from xi-liu-ds Apr 23, 2021

tianleiwu reviewed Apr 23, 2021

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/convert_to_onnx.py Outdated

tianleiwu reviewed Apr 23, 2021

View reviewed changes

Xiaoyu Liu added 3 commits April 25, 2021 18:35

rename to configurable search step

e7029ad

add option groups

6f906b3

add more unit tests

cdf7001

tianleiwu reviewed Apr 27, 2021

View reviewed changes

Comment thread onnxruntime/python/tools/transformers/benchmark_gpt2.py Outdated

Xiaoyu Liu added 2 commits April 27, 2021 22:08

clean format

2a24a58

clean format

c69763a

tianleiwu approved these changes Apr 27, 2021

View reviewed changes

Merge branch 'master' of https://github.com/microsoft/onnxruntime int…

cc1cdc5

…o xi-liu-ds/beam_search_update

tianleiwu merged commit 994c2ed into microsoft:master Apr 29, 2021

xi-liu-ds mentioned this pull request May 16, 2021

GPT-2 one step search tutorial #7718

Merged

tianleiwu mentioned this pull request Jul 2, 2021

gpt2 greedy search or beam search (auto-regressive decoding) into one onnx graph? #8188

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPT2 one step beam search update with configuration support#7425

GPT2 one step beam search update with configuration support#7425
tianleiwu merged 25 commits intomicrosoft:masterfrom
xi-liu-ds:xi-liu-ds/beam_search_update

xi-liu-ds commented Apr 22, 2021 •

edited

Loading

Uh oh!

Uh oh!

tianleiwu commented Apr 23, 2021

Uh oh!

Uh oh!

tianleiwu Apr 23, 2021 •

edited

Loading

Uh oh!

xi-liu-ds Apr 27, 2021 •

edited

Loading

Uh oh!

xi-liu-ds commented Apr 26, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xi-liu-ds commented Apr 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tianleiwu commented Apr 23, 2021

Uh oh!

Uh oh!

tianleiwu Apr 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xi-liu-ds Apr 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xi-liu-ds commented Apr 26, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xi-liu-ds commented Apr 22, 2021 •

edited

Loading

tianleiwu Apr 23, 2021 •

edited

Loading

xi-liu-ds Apr 27, 2021 •

edited

Loading