[gpt2pre 1.2] Start End Packer call method #7791

pforderique · 2023-06-28T23:29:27Z

Implements the full StartEndPacker class including it's call() method.
This is PR 2/2 of the full StartEndPacker implementation.

Depends on #7790.

NOTE:
The keras implementation includes a return_padding_mask parameter in the constructor. Setting this True makes the call method return the padded ragged tensor AND a mask of padded tokens as a tuple. This tuple return was not possible in the TypeScript implementation without changing the call type signature to [Tensor|Tensor[], Tensor|Tensor[]]. Therefore, I opted to create a separate method, callAndReturnPaddingMask() that returns just that. The call method simply takes the first result of this method call.

Room for optimization:
Currently, a mask will be calculated every time call is called, even when not needed. Decomposing the function further can help and can be done in the cleanup PR.

mattsoulanille

I have a few suggestions. Nice test coverage!

tfjs-layers/src/layers/nlp/preprocessing/start_end_packer.ts

pforderique · 2023-07-10T20:37:01Z

@mattsoulanille your comments have been addressed. Thanks for the review!

mattsoulanille

LGTM

Linchenn

LGTM!

pforderique added 8 commits June 28, 2023 01:30

Add start end packer class

619ba6f

Merge branch 'main' into start-end-packer

5b7a84a

Fix getConfig test case

1235f7b

Keep undefined values as undefined

ccd1b49

Keep undefined values as undefined

ca77ed2

Implement call.

0731c51

Remove retrunPaddingMask input to preserve call type signature.

5749fd8

Fix missing semicolon.

f2e35d6

pforderique requested review from mattsoulanille and Linchenn June 28, 2023 23:59

pforderique and others added 8 commits July 5, 2023 23:02

Explicitly type declare undefined

03e4bab

Merge branch 'master' into start-end-packer

b082047

merge in start-end-packer

4b02806

TS Style fix

9297013

Merge branch 'main' into start-end-packer

55c2070

Merge branch 'main' into start-end-packer-call

ec94506

Merge branch 'start-end-packer' into start-end-packer-call

318812c

Merge branch 'main' into start-end-packer-call

f899d8e

mattsoulanille requested changes Jul 8, 2023

View reviewed changes

pforderique and others added 4 commits July 10, 2023 11:39

Rename padEnd to ensureLength

61ee0c5

Change Tensor[] return type to Tensor2D

7417a65

Use arraySync()

5ec9a88

Wrap in tf.tidy

49ad7a1

pforderique and others added 3 commits July 10, 2023 13:38

Merge branch 'main' into start-end-packer-call

a0a9fab

Merge branch 'main' into start-end-packer-call

c27f8f5

fix lint issue

456d71d

pforderique mentioned this pull request Jul 10, 2023

[gpt2pre 4] GPT2Preprocessor Layer #7814

Merged

mattsoulanille approved these changes Jul 11, 2023

View reviewed changes

Linchenn approved these changes Jul 11, 2023

View reviewed changes

pforderique merged commit 261037e into tensorflow:master Jul 11, 2023
2 checks passed

pforderique deleted the start-end-packer-call branch July 11, 2023 17:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gpt2pre 1.2] Start End Packer call method #7791

[gpt2pre 1.2] Start End Packer call method #7791

pforderique commented Jun 28, 2023

mattsoulanille left a comment

pforderique commented Jul 10, 2023

mattsoulanille left a comment

Linchenn left a comment

[gpt2pre 1.2] Start End Packer call method #7791

[gpt2pre 1.2] Start End Packer call method #7791

Conversation

pforderique commented Jun 28, 2023

mattsoulanille left a comment

Choose a reason for hiding this comment

pforderique commented Jul 10, 2023

mattsoulanille left a comment

Choose a reason for hiding this comment

Linchenn left a comment

Choose a reason for hiding this comment