<a href="https://colab.research.google.com/github/royam0820/prompt-engineering/blob/main/l2_guidelines.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Principal Guidelines for Prompting
In this lesson, you'll practice two prompting principles and their related tactics in order to write effective prompts for large language models.




Here are the main guidelines for creating effective prompts, explained in French:

1. **Be specific**: The more specific your prompt, the more relevant the AI ​​response will be.

2. **Provide context**: Provide basic information to help the AI ​​understand the situation.

3. **Use clear language**: Avoid ambiguity and complex sentences.

4. **Structure your requests**: Organize your questions or instructions logically.

5. **Set the desired response format**: Indicate whether you want a list, paragraph, etc.

6. **Include examples**: If possible, provide examples of expected responses.

7. **Specify the tone and style**: Specify whether you want a formal, friendly, technical, etc. response.

8. **Use role instructions**: Ask the AI ​​to adopt a specific role if necessary.

9. **Ask follow-up questions**: Break complex tasks into several steps.

10. **Iterate and refine**: Don’t hesitate to adjust your prompts based on the results you get.

By following these guidelines, you can significantly improve the quality and relevance of AI-generated responses.

For this lesson, we are using the OPENAI api key.
To get it, here is the [url](https://platform.openai.com/api-keys)

## Setup
loading libraries and openai api key.

In [1]:
!pip install openai

Collecting openai
  Downloading openai-1.35.3-py3-none-any.whl (327 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m327.4/327.4 kB[0m [31m5.8 MB/s[0m eta [36m0:00:00[0m
Collecting httpx<1,>=0.23.0 (from openai)
  Downloading httpx-0.27.0-py3-none-any.whl (75 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m75.6/75.6 kB[0m [31m8.5 MB/s[0m eta [36m0:00:00[0m
Collecting httpcore==1.* (from httpx<1,>=0.23.0->openai)
  Downloading httpcore-1.0.5-py3-none-any.whl (77 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m77.9/77.9 kB[0m [31m6.4 MB/s[0m eta [36m0:00:00[0m
[?25hCollecting h11<0.15,>=0.13 (from httpcore==1.*->httpx<1,>=0.23.0->openai)
  Downloading h11-0.14.0-py3-none-any.whl (58 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m58.3/58.3 kB[0m [31m7.9 MB/s[0m eta [36m0:00:00[0m
Installing collected packages: h11, httpcore, httpx, openai
Successfully installed h11-0.14.0 httpcore-1.0.5 ht

In [2]:
import openai
import os
from google.colab import userdata


In [3]:
from pprint import pprint


In [4]:
print(openai.__version__)

1.35.3


NB:  we will be this openai version for the whole course sessions on prompt engineering.


In [5]:
openai_api_key = userdata.get("OPENAI_API_KEY")
os.environ["OPENAI_API_KEY"] = openai_api_key

#### helper function
Throughout this course, we will use OpenAI's `gpt-3.5-turbo` model and the [chat completions endpoint](https://platform.openai.com/docs/guides/chat).

This helper function will make it easier to use prompts and look at the generated outputs.  
**Note**: In June 2023, OpenAI updated gpt-3.5-turbo. The results you see in the notebook may be slightly different than those in the video. Some of the prompts have also been slightly modified to product the desired results.

In [None]:
# older openai version == 0.27.0
# def get_completion(prompt, model="gpt-3.5-turbo"):
#     messages = [{"role": "user", "content": prompt}]
#     response = openai.ChatCompletion.create(
#         model=model,
#         messages=messages,
#         temperature=0, # this is the degree of randomness of the model's output
#     )
#     return response.choices[0].message["content"]

In [8]:
# the helper function

# initializing the OpenAI client
client = openai.OpenAI()

def get_completion(prompt, model="gpt-3.5-turbo"):
    messages = [{"role": "user", "content": prompt}]
    response = client.chat.completions.create(
        model=model,
        messages=messages,
        temperature=0
    )
    return response.choices[0].message.content

NB: `messages = [{"role": "user", "content": prompt}]` Each message is a dictionary with "role" and "content" keys. Here, it's setting up a single message with the role "user" and the content being the provided prompt.

## Prompting Delimiters

### Triple Quotes
Useful for defining long strings or multiline inputs.


```
prompt = """
This is an example of using triple quotes.
"""

```



### Triple Backticks
Often used to denote code blocks or formatted text.



```
prompt = '''

```



### Brackets
Curly braces {}, square brackets [], or angle brackets <> can be used to enclose specific parts of the prompt.


```
prompt = "This is an example of using {curly braces}."

```



### Parentheses
To encapsulate specific elements.



```
prompt = "Please (enter your response here)."

```



### Dashes or Hyphens
To separate sections or emphasize certain parts.


```
prompt = "Introduction: --- Body: --- Conclusion: ---"

```



### Colons and Semicolons
To introduce lists or separate statements.



```
prompt = "Items: Apple, Banana, Cherry;"

```



### Asterisks or Stars
For bullet points or emphasis.



```
prompt = "* This is a bullet point."
```




### Pipes
Vertical bars can be used for list-like structures.



```
prompt = "Option 1 | Option 2 | Option 3"

```



### Slashes
To separate alternatives or paths.


```
prompt = "Choose one: option1/option2/option3"

```



### Arrows
To indicate directions or flows.



```
prompt = "Start -> Step 1 -> Step 2 -> End"

```



## Prompting Principles
- **Principle 1: Write clear and specific instructions**
- **Principle 2: Give the model time to “think”**

### Tactics

#### Tactic 1: Use delimiters to clearly indicate distinct parts of the input
- Delimiters can be anything like: ```
, `"""`, `< >`, `<tag> </tag>`, `:`

In [None]:
# the text to summarize
text = f"""
You should express what you want a model to do by \
providing instructions that are as clear and \
specific as you can possibly make them. \
This will guide the model towards the desired output, \
and reduce the chances of receiving irrelevant \
or incorrect responses. Don't confuse writing a \
clear prompt with writing a short prompt. \
In many cases, longer prompts provide more clarity \
and context for the model, which can lead to \
more detailed and relevant outputs.
"""

# the prompt indicating the text above for summarization
prompt = f"""
Summarize the text delimited by triple backticks \
into a single sentence.
```{text}```
"""

# get the response
response = get_completion(prompt)
pprint(response)

('Providing clear and specific instructions to a model is essential for '
 'guiding it towards the desired output and reducing the chances of irrelevant '
 'or incorrect responses, even if longer prompts are necessary for clarity and '
 'context.')


NB: by formating the prompt this way, we avoid **prompt injection** (that is giving conflicting information to the model) as we specially specified, in another paragraph using `'''` that only the `text` paragraph should be summarized and not anything else.

#### Tactic 2: Ask for a structured output
- JSON, HTML

In [None]:
prompt = f"""
Generate a list of three made-up book titles along \
with their authors and genres.
Provide them in JSON format with the following keys:
book_id, title, author, genre.
"""
response = get_completion(prompt)
print(response)

[
    {
        "book_id": 1,
        "title": "The Midnight Garden",
        "author": "Elena Rivers",
        "genre": "Fantasy"
    },
    {
        "book_id": 2,
        "title": "Echoes of the Past",
        "author": "Nathan Black",
        "genre": "Mystery"
    },
    {
        "book_id": 3,
        "title": "Whispers in the Wind",
        "author": "Samantha Reed",
        "genre": "Romance"
    }
]


NB: we have three fictitious book titles formatted in a json structured as specified, the nice thing with this format is that you can read it in a Python's dictionary or a list. See example below.

In [None]:
import json

json_data = response

data = json.loads(json_data)
print(data)


[{'book_id': 1, 'title': 'The Midnight Garden', 'author': 'Elena Rivers', 'genre': 'Fantasy'}, {'book_id': 2, 'title': 'Echoes of the Past', 'author': 'Nathan Black', 'genre': 'Mystery'}, {'book_id': 3, 'title': 'Whispers in the Wind', 'author': 'Samantha Reed', 'genre': 'Romance'}]


#### Tactic 3: Ask the model to check whether conditions are satisfied

In [None]:
# example of writing specific instructions

# Text with instructions - making a cup of tea
text_1 = f"""
Making a cup of tea is easy! First, you need to get some \
water boiling. While that's happening, \
grab a cup and put a tea bag in it. Once the water is \
hot enough, just pour it over the tea bag. \
Let it sit for a bit so the tea can steep. After a \
few minutes, take out the tea bag. If you \
like, you can add some sugar or milk to taste. \
And that's it! You've got yourself a delicious \
cup of tea to enjoy.
"""
prompt = f"""
You will be provided with text delimited by triple quotes.
If it contains a sequence of instructions, \
re-write those instructions in the following format:

Step 1 - ...
Step 2 - …
…
Step N - …

If the text does not contain a sequence of instructions, \
then simply write \"No steps provided.\"

\"\"\"{text_1}\"\"\"
"""
response = get_completion(prompt)
print("Completion for Text 1:")
print(response)

Completion for Text 1:
Step 1 - Get some water boiling.
Step 2 - Grab a cup and put a tea bag in it.
Step 3 - Pour the hot water over the tea bag.
Step 4 - Let the tea steep for a few minutes.
Step 5 - Remove the tea bag.
Step 6 - Add sugar or milk to taste.
Step 7 - Enjoy your delicious cup of tea.


NB1: This prompt explains that the provided text (enclosed in triple quotes) (`"""`)should be checked to see if it contains a sequence of instructions. If it does, it should reformat them into a step-by-step list. If not, it should return "No steps provided."

NB2: Escaping Triple Quotes. `\"\"\"{text_1}\"\"\"` To include triple quotes inside another string, you need to escape each double-quote character with a backslash (`\"`). This is necessary to ensure that the triple quotes are treated as part of the string content and not as the end of the string definition.

In [None]:
# text without instructions
text_2 = f"""
The sun is shining brightly today, and the birds are \
singing. It's a beautiful day to go for a \
walk in the park. The flowers are blooming, and the \
trees are swaying gently in the breeze. People \
are out and about, enjoying the lovely weather. \
Some are having picnics, while others are playing \
games or simply relaxing on the grass. It's a \
perfect day to spend time outdoors and appreciate the \
beauty of nature.
"""
prompt = f"""
You will be provided with text delimited by triple quotes.
If it contains a sequence of instructions, \
re-write those instructions in the following format:

Step 1 - ...
Step 2 - …
…
Step N - …

If the text does not contain a sequence of instructions, \
then simply write \"No steps provided.\"

\"\"\"{text_2}\"\"\"
"""
response = get_completion(prompt)
print("Completion for Text 2:")
print(response)

Completion for Text 2:
No steps provided.


NB: the prompt was not able to find instructions from the prompt provided.

#### Tactic 4: "Few-shot" prompting

Few-shot prompting is a technique to guide the model in generating desired responses. This is done by providing the model with a few examples (*shots*) of the *input-output pairs* before asking it to generate an output for a new input. The goal is to give the model a sense of the pattern or format it should follow when producing its response. See example below :



```
Convert the following sentences into polite form:

Example 1:
Input: Close the door.
Output: Could you please close the door?

Example 2:
Input: Give me the report.
Output: Could you please give me the report?

Example 3:
Input: Stop talking.
Output: Could you please stop talking?

Now, convert the following sentence:

Input: Pass the salt.
Output:
```



#### Few-shot Prompting Applications

**Text Transformation**: Rewriting sentences in a specific tone, style, or format.

**Question Answering**: Providing examples of how questions should be answered.

**Summarization**: Showing examples of how to condense information into summaries.

**Translation**: Giving pairs of source and target language sentences to guide translations.

**Text Transformation**: Rewriting sentences in a specific tone, style, or format.


In [None]:
# Example of a conversation between a child and his/her grandparent
# the grandparent answer the child with some kind of methaphore
# so the prompt answer will apply this tone for the future answer from the child.
prompt = f"""
Your task is to answer in a consistent style.

<child>: Teach me about patience.

<grandparent>: The river that carves the deepest \
valley flows from a modest spring; the \
grandest symphony originates from a single note; \
the most intricate tapestry begins with a solitary thread.

<child>: Teach me about resilience.
"""
response = get_completion(prompt)
pprint(response)

('<grandparent>: Just as a tree bends but does not break in a storm, '
 'resilience is the ability to bounce back from adversity. It is the strength '
 'to persevere in the face of challenges and setbacks, knowing that every '
 'trial is an opportunity for growth.')


### Principle 2: Give the model time to “think”

#### Tactic 1: Specify the steps required to complete a task

In [9]:
# giving the model time to think
text = f"""
In a charming village, siblings Jack and Jill set out on \
a quest to fetch water from a hilltop \
well. As they climbed, singing joyfully, misfortune \
struck—Jack tripped on a stone and tumbled \
down the hill, with Jill following suit. \
Though slightly battered, the pair returned home to \
comforting embraces. Despite the mishap, \
their adventurous spirits remained undimmed, and they \
continued exploring with delight.
"""
# example 1
prompt_1 = f"""
Perform the following actions:
1 - Summarize the following text delimited by triple \
backticks with 1 sentence.
2 - Translate the summary into French.
3 - List each name in the French summary.
4 - Output a json object that contains the following \
keys: french_summary, num_names.

Separate your answers with line breaks.

Text:
```{text}```
"""
response = get_completion(prompt_1)
print("Completion for prompt 1:")
pprint(response)

Completion for prompt 1:
('1 - Jack and Jill go on a quest to fetch water from a well, but encounter '
 'misfortune on the way back home.\n'
 '\n'
 "2 - Jack et Jill partent en quête d'eau d'un puits, mais rencontrent un "
 'malheur sur le chemin du retour.\n'
 '\n'
 '3 - Jack, Jill\n'
 '\n'
 '4 - \n'
 '{\n'
 '  "french_summary": "Jack et Jill partent en quête d\'eau d\'un puits, mais '
 'rencontrent un malheur sur le chemin du retour.",\n'
 '  "num_names": 2\n'
 '}')


NB: the text ouput has line breaks (`\n`) as specified in the prompt.

#### Ask for output in a specified format

In [None]:
prompt_2 = f"""
Your task is to perform the following actions:
1 - Summarize the following text delimited by
  <> with 1 sentence.
2 - Translate the summary into French.
3 - List each name in the French summary.
4 - Output a json object that contains the
  following keys: french_summary, num_names.

Use the following format:
Text: <text to summarize>
Summary: <summary>
Translation: <summary translation>
Names: <list of names in summary>
Output JSON: <json with summary and num_names>

Text: <{text}>
"""
response = get_completion(prompt_2)
print("\nCompletion for prompt 2:")
print(response)


Completion for prompt 2:
Summary: Jack and Jill, two siblings, go on a quest to fetch water from a well on a hill, but encounter misfortune along the way.
Translation: Jack et Jill, deux frères et sœurs, partent en quête d'eau d'un puits sur une colline, mais rencontrent des malheurs en chemin.
Names: Jack, Jill
Output JSON: {"french_summary": "Jack et Jill, deux frères et sœurs, partent en quête d'eau d'un puits sur une colline, mais rencontrent des malheurs en chemin.", "num_names": 2}


NB: in the example above, we are using `< >` to make reference to our variable text instead of `{ }`.  You can use any delimiter you like that finally will make sense to the model.

#### Tactic 2: Instruct the model to work out its own solution before rushing to a conclusion

In [None]:
prompt = f"""
Determine if the student's solution is correct or not.

Question:
I'm building a solar power installation and I need \
 help working out the financials.
- Land costs $100 / square foot
- I can buy solar panels for $250 / square foot
- I negotiated a contract for maintenance that will cost \
me a flat $100k per year, and an additional $10 / square \
foot
What is the total cost for the first year of operations
as a function of the number of square feet.

Student's Solution:
Let x be the size of the installation in square feet.
Costs:
1. Land cost: 100x
2. Solar panel cost: 250x
3. Maintenance cost: 100,000 + 100x
Total cost: 100x + 250x + 100,000 + 100x = 450x + 100,000
"""
response = get_completion(prompt)
pprint(response)

("The student's solution is correct. The total cost for the first year of "
 'operations as a function of the number of square feet is indeed 450x + '
 '100,000.')


NB: Actually the student's solution is not correct !
We can fix this by instructing the model to work out its own solution first.

In [None]:
prompt = f"""
Your task is to determine if the student's solution \
is correct or not.
To solve the problem do the following:
- First, work out your own solution to the problem including the final total.
- Then compare your solution to the student's solution \
and evaluate if the student's solution is correct or not.
Don't decide if the student's solution is correct until
you have done the problem yourself.

Use the following format:
Question:
```
question here
```
Student's solution:
```
student's solution here
```
Actual solution:
```
steps to work out the solution and your solution here
```
Is the student's solution the same as actual solution \
just calculated:
```
yes or no
```
Student grade:
```
correct or incorrect
```

Question:
```
I'm building a solar power installation and I need help \
working out the financials.
- Land costs $100 / square foot
- I can buy solar panels for $250 / square foot
- I negotiated a contract for maintenance that will cost \
me a flat $100k per year, and an additional $10 / square \
foot
What is the total cost for the first year of operations \
as a function of the number of square feet.
```
Student's solution:
```
Let x be the size of the installation in square feet.
Costs:
1. Land cost: 100x
2. Solar panel cost: 250x
3. Maintenance cost: 100,000 + 100x
Total cost: 100x + 250x + 100,000 + 100x = 450x + 100,000
```
Actual solution:
"""
response = get_completion(prompt)
print(response)

The actual solution is correct.

Actual solution:
Total cost = Land cost + Solar panel cost + Maintenance cost
Total cost = $100x + $250x + $100,000 + $10x
Total cost = $360x + $100,000

Is the student's solution the same as actual solution just calculated:
```
No
```
Student grade:
```
Incorrect
```


## Model Limitations: Hallucinations
- Boie is a real company, the product name is not real.

In [None]:
prompt = f"""
Tell me about AeroGlide UltraSlim Smart Toothbrush by Boie
"""
response = get_completion(prompt)
pprint(response)

('The AeroGlide UltraSlim Smart Toothbrush by Boie is a high-tech toothbrush '
 'designed to provide a superior cleaning experience. It features a slim and '
 'sleek design that makes it easy to hold and maneuver in the mouth. The '
 'toothbrush is equipped with smart technology that tracks your brushing '
 'habits and provides real-time feedback to help you improve your oral hygiene '
 'routine.\n'
 '\n'
 'The AeroGlide UltraSlim Smart Toothbrush also has soft, tapered bristles '
 'that are gentle on the gums and teeth, making it suitable for those with '
 'sensitive mouths. The bristles are made from a durable and hygienic material '
 'that is resistant to bacteria growth, ensuring a clean and healthy brushing '
 'experience.\n'
 '\n'
 'Overall, the AeroGlide UltraSlim Smart Toothbrush by Boie is a cutting-edge '
 'toothbrush that combines advanced technology with high-quality materials to '
 'provide a superior cleaning experience for users.')


## Try experimenting on your own!

In [None]:
prompt = f"""
Donne moi des infos à propos de cette voiture supercar de Peugeot
"""
response = get_completion(prompt)
pprint(response)

('La supercar de Peugeot dont vous parlez est probablement la Peugeot 908 RC, '
 'un concept car dévoilé en 2006. Cette voiture de sport de luxe était équipée '
 "d'un moteur V12 diesel de 5,5 litres développant 700 chevaux, ce qui lui "
 "permettait d'atteindre une vitesse maximale de 300 km/h. La Peugeot 908 RC "
 "était également dotée d'un design élégant et futuriste, avec des portes "
 'papillon et un intérieur luxueux en cuir et en aluminium. Malheureusement, '
 "ce concept car n'a jamais été produit en série et reste un modèle unique.")


In [None]:
prompt = f"""
Qui a écrit le livre "Les Mondes d'Aldébaran" ?
"""
response = get_completion(prompt)
pprint(response)

('Le livre "Les Mondes d\'Aldébaran" a été écrit par le scénariste et '
 'dessinateur de bande dessinée français, Leo.')


#### Notes on using the OpenAI API outside of this classroom

To install the OpenAI Python library:
```
!pip install openai
```

The library needs to be configured with your account's secret key, which is available on the [website](https://platform.openai.com/account/api-keys).

You can either set it as the `OPENAI_API_KEY` environment variable before using the library:
 ```
 !export OPENAI_API_KEY='sk-...'
 ```

Or, set `openai.api_key` to its value:

```
import openai
openai.api_key = "sk-..."
```

#### A note about the backslash
- In the course, we are using a backslash `\` to make the text fit on the screen without inserting newline '\n' characters.
- GPT-3 isn't really affected whether you insert newline characters or not.  But when working with LLMs in general, you may consider whether newline characters in your prompt may affect the model's performance.

Here is a more detailed explanation:

Using backslashes () instead of line returns in prompts can be beneficial for several reasons:

- Readability in code: It allows you to write long prompts across multiple lines in your code without actually inserting line breaks into the prompt text itself. This makes your code more readable while keeping the prompt as a single continuous string.
- Consistency: Some AI models might interpret actual line breaks differently, potentially affecting the output. Using \ ensures the prompt is processed as one continuous piece of text.
- Easier string manipulation: When the entire prompt is on one logical line in your code, it's often easier to perform string operations or modifications if needed.
- Avoiding unintended formatting: In some contexts, actual line breaks might be interpreted as formatting (e.g., in Markdown), which could alter the meaning of your prompt.-