Add perplexity example to the `logprobs` user guide #1071

ankur-oai · 2024-02-24T00:35:57Z

Summary

Adds a section to demonstrate how logprobs can be used to assess model confidence in overall results. The changes introduce the concept of perplexity, add code to calculate it for two examples, and display the output.

Motivation

These changes build on a suggestion at the end of the original guide to use logprobs for different evaluation metrics, filling in a recommended extension.

For new content

When contributing new content, read through our contribution guidelines, and mark the following action items as completed:

I have added a new entry in registry.yaml (and, optionally, in authors.yaml) so that my content renders on the cookbook website.
I have conducted a self-review of my content based on the contribution guidelines:
- Relevance: This content is related to building with OpenAI technologies and is useful to others.
- Uniqueness: I have searched for related examples in the OpenAI Cookbook, and verified that my content offers new insights or unique information compared to existing documentation.
- Spelling and Grammar: I have checked for spelling or grammatical mistakes.
- Clarity: I have done a final read-through and verified that my submission is well-organized and easy to understand.
- Correctness: The information I include is correct and all of my code executes successfully.
- Completeness: I have explained everything fully, including all necessary references and citations.

We will rate each of these areas on a scale from 1 to 4, and will only accept contributions that score 3 or higher on all areas. Refer to our contribution guidelines for more details.

jhills20

Looks great! Thanks Ankur. Added some small nits. Generally, I think also showing each token's probability would be helpful too, maybe just for one sentence or for both. Because then we could see which tokens the model is more or less confident in, which might help people grasp this intuitively. Could also make prompt to make the sentences briefer if it's too much to do each logprobs for the longer sentences.

Generally good stuff tho! Thanks

jhills20 · 2024-02-27T01:36:21Z

examples/Using_logprobs.ipynb

+    "*  Users can easily create a token highlighter using the built in tokenization that comes with enabling `logprobs`. Additionally, the bytes parameter includes the ASCII encoding of each output character, which is particularly useful for reproducing emojis and special characters.\n",
+    "\n",
+    "4. Calculating perplexity\n",
+    "* `logprobs1 can be used to help us assess the model's overall confidence in a result and help us compare the confidence of results from different prompts."


replace 1 with closing `

jhills20 · 2024-02-27T01:37:05Z

examples/Using_logprobs.ipynb

+    "4. Token highlighting and outputting bytes\n",
+    "*  Users can easily create a token highlighter using the built in tokenization that comes with enabling `logprobs`. Additionally, the bytes parameter includes the ASCII encoding of each output character, which is particularly useful for reproducing emojis and special characters.\n",
+    "\n",
+    "4. Calculating perplexity\n",


change to 5.

jhills20 · 2024-02-27T01:39:18Z

examples/Using_logprobs.ipynb

-    "## 5. Conclusion"
+    "## 5. Calculating perplexity\n",
+    "\n",
+    "When looking to assess the model's confidence in a result, it can be useful to calculate perplexity, which is a measure of the uncertainty. Perplexity can be calculated by exponentiating the negative of the average of the logprobs. Generally, a higher perplexity indicates a more uncertain result, and a lower perplexity indicates a more confident result. As such, perplexity can be used to both assess the result of an individual model run and also to helpfully compare the relative confidence of results between model runs. While a high confidence doesn't guarantee result accuracy, it can be a helpful signal that can be paired with other evaluation metrics to build a better understanding of your prompt's behavior.\n",


"is a measure of the model's uncertainty."
.."average of the output logprobs"
would remove "helpfully" before compare

shyamal-anadkat

lgtm

shyamal-anadkat · 2024-03-09T00:44:03Z

examples/Using_logprobs.ipynb

+    "*  Users can easily create a token highlighter using the built in tokenization that comes with enabling `logprobs`. Additionally, the bytes parameter includes the ASCII encoding of each output character, which is particularly useful for reproducing emojis and special characters.\n",
+    "\n",
+    "4. Calculating perplexity\n",
+    "* `logprobs1 can be used to help us assess the model's overall confidence in a result and help us compare the confidence of results from different prompts."


nit - fix logprobs formatting

add perplexity exmaple

126559b

ankur-oai requested review from shyamal-anadkat and jhills20 February 24, 2024 00:35

jhills20 reviewed Feb 27, 2024

View reviewed changes

ankur-oai and others added 2 commits March 8, 2024 15:17

update to include explicit logprobs with visual alignment

677ec8a

Merge branch 'main' into ankur/logprobs-perplexity-update

4473813

shyamal-anadkat approved these changes Mar 9, 2024

View reviewed changes

shyamal-anadkat reviewed Mar 9, 2024

View reviewed changes

ankur-oai added 2 commits March 8, 2024 16:53

fix nits

3ec50b6

fix ordering

4583d9e

ankur-oai merged commit ed6194e into main Mar 9, 2024

ankur-oai deleted the ankur/logprobs-perplexity-update branch March 9, 2024 00:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add perplexity example to the `logprobs` user guide #1071

Add perplexity example to the `logprobs` user guide #1071

ankur-oai commented Feb 24, 2024

jhills20 left a comment

jhills20 Feb 27, 2024

jhills20 Feb 27, 2024

jhills20 Feb 27, 2024

shyamal-anadkat left a comment

shyamal-anadkat Mar 9, 2024

Add perplexity example to the logprobs user guide #1071

Add perplexity example to the logprobs user guide #1071

Conversation

ankur-oai commented Feb 24, 2024

Summary

Motivation

For new content

jhills20 left a comment

Choose a reason for hiding this comment

jhills20 Feb 27, 2024

Choose a reason for hiding this comment

jhills20 Feb 27, 2024

Choose a reason for hiding this comment

jhills20 Feb 27, 2024

Choose a reason for hiding this comment

shyamal-anadkat left a comment

Choose a reason for hiding this comment

shyamal-anadkat Mar 9, 2024

Choose a reason for hiding this comment

Add perplexity example to the `logprobs` user guide #1071

Add perplexity example to the `logprobs` user guide #1071