![image](https://raw.githubusercontent.com/IBM/watson-machine-learning-samples/master/cloud/notebooks/headers/watsonx-Prompt_Lab-Notebook.png)
# Prompt Notebook - Prompt Lab Notebook v1.1.0
This notebook contains steps and code to demonstrate inferencing of prompts
generated in Prompt Lab in watsonx.ai. It introduces Python API commands
for authentication using API key and prompt inferencing using WML API.

**Note:** Notebook code generated using Prompt Lab will execute successfully.
If code is modified or reordered, there is no guarantee it will successfully execute.
For details, see: <a href="/docs/content/wsj/analyze-data/fm-prompt-save.html?context=wx" target="_blank">Saving your work in Prompt Lab as a notebook.</a>

Some familiarity with Python is helpful. This notebook uses Python 3.10.

## Notebook goals
The learning goals of this notebook are:

* Defining a Python function for obtaining credentials from the IBM Cloud personal API key
* Defining parameters of the Model object
* Using the Model object to generate response using the defined model id, parameters and the prompt input

# Setup

## watsonx API connection
This cell defines the credentials required to work with watsonx API for Foundation
Model inferencing.

**Action:** Provide the IBM Cloud personal API key. For details, see
<a href="https://cloud.ibm.com/docs/account?topic=account-userapikey&interface=ui" target="_blank">documentation</a>.


In [1]:
apikey = 'oN6RONJsBd62GOY5OzedevWK67qfzd3M_d37IiOEmJlQ'

In [2]:
import os
import getpass

def get_credentials():
	return {
		"url" : "https://us-south.ml.cloud.ibm.com",
		"apikey" : apikey
	}

# Inferencing
This cell demonstrated how we can use the model object as well as the created access token
to pair it with parameters and input string to obtain
the response from the the selected foundation model.

## Defining the model id
We need to specify model id that will be used for inferencing:


In [3]:
model_id = "mistralai/mistral-large" #Muốn đổi mô hình thì đổi ở đây

# Một số mô hình trên watsonx.ai
# meta-llama/llama-3-70b-instruct
# ibm/granite-13b-chat-v2 (cái này dở tiếng Việt)
# meta-llama/llama-2-70b-chat
# ibm/granite-20b-multilingual


## Defining the model parameters
We need to provide a set of model parameters that will influence the
result:

In [4]:
parameters = {
    "decoding_method": "greedy", #Đổi sang sampling thì mô hình sẽ "sáng tạo" hơn
    "max_new_tokens": 300, #Chiều dài tối đa cho output - để đề phòng mô hình nói nhiều
    "repetition_penalty": 1 #Giá trị từ 1-2 (1 thì mô hình luôn trả lời y chang, 2 thì ngược lại)
}

## Defining the project id or space id
The API requires project id or space id that provides the context for the call. We will obtain
the id from the project or space in which this notebook runs:

In [5]:
# project_id = os.getenv("PROJECT_ID")
# space_id = os.getenv("SPACE_ID")

project_id = '54be1e83-1d30-46bf-9992-1fedd0038b05'

## Defining the Model object
We need to define the Model object using the properties we defined so far:


In [6]:
from ibm_watsonx_ai.foundation_models import Model

model = Model(
	model_id = model_id,
	params = parameters,
	credentials = get_credentials(),
	project_id = project_id
)


## Defining the inferencing input
Foundation model inferencing API accepts a natural language input that it will use
to provide the natural language response. The API is sensitive to formatting. Input
structure, presence of training steps (one-shot, two-shot learning etc.), as well
as phrasing all influence the final response and belongs to the emerging discipline of
Prompt Engineering.

Let us provide the input we got from the Prompt Lab:


**Đổi prompt ở đây**

In [7]:
# Này là biến để chứa nội dung tài liệu muốn đưa vào
search_results = "Our company policy states that all vacations are limited to at most 3 days of paid leave. And each employee are entitled to a total of 7 paid vacation leave days per year."

**Đoạn đầu của prompt là hướng dẫn cho mô hình**

In [8]:
prompt_input = f"""You are a chatbot assistant designed to perform RAG use cases. You will be provided with some documents to use as context to answer the user's question. Always answer in an accurate and positive style. Do not provide answers that are hate, abuse, profanity or direct attacks at any groups of ethnicity or race. Make sure your answer is grounded in the given context. 
If the question cannot be answered using the given context, explain and state you cannot answer the question. Do not provide false or made-up information.

Context:
{search_results}

Question: I am going on a 5-day vacation. Can I still be paid?
Answer:"""


In [9]:
#prompt_input

## Execution
Let us now use the defined Model object and pair it with input and
generate the response:


In [10]:
generated_response = model.generate_text(prompt=prompt_input, guardrails=True) #guardrails là để phòng tránh mô hình nói tục này nọ :))
print(generated_response)

 Based on the company policy provided, you are entitled to a maximum of 3 days of paid leave for a single vacation. Therefore, for your 5-day vacation, only 3 days can be paid. The remaining 2 days would be considered unpaid leave. However, please consult with your HR department for the most accurate information as they have the most up-to-date and detailed policies.


# Next steps
You successfully completed this notebook! You learned how to use
watsonx.ai inferencing SDK to generate response from the foundation model
based on the provided input, model id and model parameters. Check out the
official watsonx.ai site for more samples, tutorials, documentation, how-tos, and blog posts.

<a id="copyrights"></a>
### Copyrights

Licensed Materials - Copyright © 2023 IBM. This notebook and its source code are released under the terms of the ILAN License.
Use, duplication disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

**Note:** The auto-generated notebooks are subject to the International License Agreement for Non-Warranted Programs (or equivalent) and License Information document for watsonx.ai Auto-generated Notebook (License Terms), such agreements located in the link below. Specifically, the Source Components and Sample Materials clause included in the License Information document for Watson Studio Auto-generated Notebook applies to the auto-generated notebooks.  

By downloading, copying, accessing, or otherwise using the materials, you agree to the <a href="https://www14.software.ibm.com/cgi-bin/weblap/lap.pl?li_formnum=L-AMCU-BYC7LF" target="_blank">License Terms</a>  