Skip top_p computations when set to 1.0 #8905

odelalleau · 2024-04-12T16:48:42Z

What does this PR do ?

This avoid doing useless computations (that may even filter out some tokens due to numerical approximation) when top_p=1.0 (which is not supposed to have any effect).

Collection: nlp

Changelog

Skip top_p computations when set to 1.0

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation
Optimization

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com>

odelalleau · 2024-04-12T16:49:11Z

jenkins

This is because we otherwise do useless computations, at least until NVIDIA/NeMo#8905 is merged

This is because we otherwise do useless computations, at least until NVIDIA/NeMo#8905 is merged Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com>

yidong72

LGTM.

pablo-garay · 2024-04-13T01:37:09Z

jenkins

pablo-garay · 2024-04-13T22:30:04Z

jenkins

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Signed-off-by: Ao Tang <aot@nvidia.com>

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>

odelalleau requested a review from yidong72 April 12, 2024 16:48

github-actions bot added the NLP label Apr 12, 2024

Skip top_p computations when set to 1.0

22c6f0c

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com>

odelalleau force-pushed the od/top_p branch from 1961c61 to 22c6f0c Compare April 12, 2024 16:49

odelalleau added a commit to NVIDIA/NeMo-Aligner that referenced this pull request Apr 12, 2024

Use top_p=0.0 by default (instead of 1.0)

c66fba2

This is because we otherwise do useless computations, at least until NVIDIA/NeMo#8905 is merged

odelalleau mentioned this pull request Apr 12, 2024

Use top_p=0.0 by default (instead of 1.0) NVIDIA/NeMo-Aligner#152

Open

yidong72 approved these changes Apr 12, 2024

View reviewed changes

Merge branch 'main' into od/top_p

33d2b8d

Merge branch 'main' into od/top_p

d2477b4

odelalleau and others added 2 commits April 19, 2024 16:13

Merge branch 'main' into od/top_p

f16cd2a

Merge branch 'main' into od/top_p

6cd5afa

odelalleau merged commit e0b3fe5 into NVIDIA:main Apr 22, 2024
125 checks passed

odelalleau deleted the od/top_p branch April 22, 2024 20:55

xingyaoww pushed a commit to xingyaoww/NeMo that referenced this pull request Apr 23, 2024

Skip top_p computations when set to 1.0 (NVIDIA#8905)

593484b

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>

alxzhang-amazon pushed a commit to alxzhang-amazon/NeMo that referenced this pull request Apr 26, 2024

Skip top_p computations when set to 1.0 (NVIDIA#8905)

dc27a54

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>

galv pushed a commit to galv/NeMo that referenced this pull request Apr 29, 2024

Skip top_p computations when set to 1.0 (NVIDIA#8905)

34d8878

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>

suiyoubi pushed a commit that referenced this pull request May 2, 2024

Skip top_p computations when set to 1.0 (#8905)

7347e21

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Signed-off-by: Ao Tang <aot@nvidia.com>

rohitrango pushed a commit to rohitrango/NeMo that referenced this pull request Jun 25, 2024

Skip top_p computations when set to 1.0 (NVIDIA#8905)

fee5110

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip top_p computations when set to 1.0 #8905

Skip top_p computations when set to 1.0 #8905

odelalleau commented Apr 12, 2024

odelalleau commented Apr 12, 2024

yidong72 left a comment

pablo-garay commented Apr 13, 2024

pablo-garay commented Apr 13, 2024

Skip top_p computations when set to 1.0 #8905

Skip top_p computations when set to 1.0 #8905

Conversation

odelalleau commented Apr 12, 2024

What does this PR do ?

Changelog

Before your PR is "Ready for review"

Who can review?

odelalleau commented Apr 12, 2024

yidong72 left a comment

Choose a reason for hiding this comment

pablo-garay commented Apr 13, 2024

pablo-garay commented Apr 13, 2024