Canary greedy and temperature decoding #8885

pzelasko · 2024-04-11T14:10:22Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

Add specific line by line info of high level changes in this PR.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Jenkins CI

To run Jenkins, a NeMo User with write access must comment jenkins on the PR.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

nemo/collections/asr/parts/submodules/multitask_decoding.py

@@ -103,30 +107,44 @@
        self.preserve_alignments = self.cfg.get('preserve_alignments', None)
        self.compute_langs = self.cfg.get('compute_langs', False)
        self.compute_hypothesis_token_set = self.cfg.get('compute_hypothesis_token_set', False)
+        self.transformer_decoder = transformer_decoder
+        self.log_softmax_module = log_softmax_module
+        self.tokenizer = tokenizer


nemo/collections/asr/parts/submodules/multitask_greedy_decoding.py

stevehuang52 · 2024-04-11T14:29:39Z

nemo/collections/asr/modules/transformer/transformer_generators.py

    ):
        super().__init__()
        self.embedding = embedding
        self.decoder = decoder
        self.log_softmax = log_softmax
+        self.log_softmax.mlp.log_softmax = False


Need make it work for any log_softmax modules, we can't assume that all those modules have log_softmax.mlp.log_softmax

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

nemo/collections/asr/parts/submodules/multitask_greedy_decoding.py

+        self.greedy_search = GreedySequenceGenerator(
+            embedding=transformer_decoder.embedding,
+            decoder=transformer_decoder.decoder,
+            log_softmax=log_softmax_module,
+            max_sequence_length=transformer_decoder.max_sequence_length,
+            bos=tokenizer.bos_id,
+            pad=tokenizer.pad_id,
+            eos=tokenizer.eos_id,
+            max_delta_length=max_generation_delta,
+            temperature=self.temperature,
+            n_samples=n_samples,
+        )


stevehuang52 · 2024-04-16T14:26:05Z

nemo/collections/asr/modules/transformer/transformer_generators.py

    ):
        super().__init__()
        self.embedding = embedding
        self.decoder = decoder
-        self.log_softmax = log_softmax
+        self.classifier = classifier.set_log_softmax_enabled(False)


Shall we add a check to see if the classifier has the set_log_softmax_enabled() function, and if not, we default to not using temperature sampling and print out a warning?

stevehuang52 · 2024-04-16T14:26:48Z

nemo/collections/asr/modules/transformer/transformer_generators.py

@@ -107,8 +112,8 @@ def _one_step_forward(
            decoder_mems_list = self.decoder.forward(
                decoder_hidden_states, decoder_input_mask, decoder_mems_list, return_mems=True
            )
-        log_probs = self.log_softmax.forward(hidden_states=decoder_mems_list[-1][:, -1:])
-        return log_probs, decoder_mems_list
+        logits = self.classifier.forward(hidden_states=decoder_mems_list[-1][:, -1:], temperature=self.temperature)


shall we add a check to see if the forward function has temperature arg?

stevehuang52 · 2024-04-16T14:34:35Z

nemo/collections/asr/modules/transformer/transformer_generators.py

+            samples = list(tgt.view(orig_batch_size, self.n_samples, -1))
+            tgt = tgt[:: self.n_samples]
+
+        return tgt, samples


since we're adding an additional output samples, do we also need to update the __call__ function where the _forward function is called?

github-actions · 2024-05-01T01:46:49Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions · 2024-05-10T01:46:13Z

This PR was closed because it has been inactive for 7 days since being marked as stale.

pzelasko added 4 commits April 10, 2024 12:14

Greedy and temperature sampling decoding for Canary/multi-task

60cafd9

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

Enable changing multitask decoding strategy

b3b23dd

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

fix various bugs and support temperature sampling n samples

e5a82a2

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

fix greedy non-temperature decoding

5ad0e69

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

pzelasko requested review from stevehuang52 and zhehuaichen April 11, 2024 14:10

github-actions bot added the ASR label Apr 11, 2024

github-advanced-security bot found potential problems Apr 11, 2024

View reviewed changes

stevehuang52 reviewed Apr 11, 2024

View reviewed changes

Refactor + unit tests

8e6b220

Signed-off-by: Piotr Żelasko <petezor@gmail.com>

github-actions bot added the common label Apr 12, 2024

github-advanced-security bot found potential problems Apr 12, 2024

View reviewed changes

stevehuang52 reviewed Apr 16, 2024

View reviewed changes

github-actions bot added the stale label May 1, 2024

github-actions bot closed this May 10, 2024

pzelasko mentioned this pull request Jul 31, 2024

Fix Canary not stripping prompt from reference + more test coverage #9987

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Canary greedy and temperature decoding #8885

Canary greedy and temperature decoding #8885

pzelasko commented Apr 11, 2024

stevehuang52 Apr 11, 2024

stevehuang52 Apr 16, 2024

stevehuang52 Apr 16, 2024

stevehuang52 Apr 16, 2024

github-actions bot commented May 1, 2024

github-actions bot commented May 10, 2024

Canary greedy and temperature decoding #8885

Canary greedy and temperature decoding #8885

Conversation

pzelasko commented Apr 11, 2024

What does this PR do ?

Changelog

Usage

Jenkins CI

Before your PR is "Ready for review"

Who can review?

Additional Information

stevehuang52 Apr 11, 2024

Choose a reason for hiding this comment

stevehuang52 Apr 16, 2024

Choose a reason for hiding this comment

stevehuang52 Apr 16, 2024

Choose a reason for hiding this comment

stevehuang52 Apr 16, 2024

Choose a reason for hiding this comment

github-actions bot commented May 1, 2024

github-actions bot commented May 10, 2024