Ordering of output in sentiment_pipe function #117

kaushalshetty · 2023-01-27T22:09:25Z

In the notebook gpt2-sentiment-control.ipynb (Optimize model section) ,
logits = [torch.tensor(output[1]["score"]) for output in sentiment_pipe(texts, **sentiment_pipe_kwargs)]

Why do we store output[1]["score"] as the reward? I assumed that "We will use the logits for positive class as a reward signal for the language model." but does sentiment_pipe always has the output[1] as an index for the the positive class?

The text was updated successfully, but these errors were encountered:

lvwerra · 2023-01-30T11:06:54Z

I just checked and you are right, this is not always the case! I will fix the example. cc @younesbelkada maybe this explains the spikes.

lvwerra · 2023-02-07T09:26:24Z

This should be fixed with #126.

lvwerra closed this as completed Feb 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ordering of output in sentiment_pipe function #117

Ordering of output in sentiment_pipe function #117

kaushalshetty commented Jan 27, 2023

lvwerra commented Jan 30, 2023

lvwerra commented Feb 7, 2023

Ordering of output in sentiment_pipe function #117

Ordering of output in sentiment_pipe function #117

Comments

kaushalshetty commented Jan 27, 2023

lvwerra commented Jan 30, 2023

lvwerra commented Feb 7, 2023