# Lab 1: Try Cloud-based LLM API Services

## You will learn:
- First experience how to do run program on the cloud
- Learn how to manage API keys
- Frist experience of using different LLM APIs
- (If you haven't used it before), how to use Jupyter Notebook in VSCode

## 0 Preparations

### 0.1 Dependencies

In [17]:
# requirements.txt contains the basic packages needed to implement this project
# We have installed all dependencies in the default image, so you do not have to install them again, if you use the default image.
#!pip install -r requirements.txt

### 0.2 Saving your API token in a .env file

In [18]:

# Instead of hardcoding the OpenAI API key, use the dotenv package to load it securely from environment variables.
# 
# Instructions to do it:
# 1. Install the dotenv package if you haven't already by running: `pip install python-dotenv`
# 2. Create a new file named .env in the root directory of your project. (AND Never commit it to Git!)
# 3. The content in this file should be stored as key-value pair. The .env file is simply a text file with one key-value per line like:
# 
#     # Comment 1
#     KEY1=value1
#     # Comment 2
#     KEY2=value2
# 
# 4. Load the environment variables in your Python code using the dotenv package:
# 
#     from dotenv import load_dotenv
#     import os
#     load_dotenv()
#     openai_api_key = os.environ.get("INFINI_API_KEY")
#     openai_base_url = os.environ.get("INFINI_BASE_URL")
# 
# More information see:
# 
# https://pythonjishu.com/ifggzibrpkgavow/ 

##  1 Using OpenAI API

### 1.1 Get response from a public API server

In [19]:
# This code loads the OpenAI API key and base URL from environment variables using the dotenv package.
# It ensures that sensitive information is not hardcoded in the script, enhancing security.

from dotenv import load_dotenv
import os
load_dotenv()
openai_api_key = os.environ.get("INFINI_API_KEY")
openai_base_url = os.environ.get("INFINI_BASE_URL")

print(openai_base_url)  

In [20]:
from openai import OpenAI
client = OpenAI(api_key=openai_api_key, base_url=openai_base_url)

# You can choose a model from the following list
# Or you can log into your Infini-AI or SiliconFlow account, and find an available model you want to use.
# model = "Qwen/QVQ-72B-Preview"
# model="llama-3.3-70b-instruct"
model="deepseek-r1-distill-qwen-32b"

response = client.chat.completions.create(
  model=model,
  messages=[
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Who won the world series in 2020?"},
    {"role": "assistant", "content": "The Los Angeles Dodgers won the World Series in 2020."},
    {"role": "user", "content": "Where was it played?"}
  ]
)
print(response)
print(response.choices[0].message.content)

In [21]:
# pretty format the response
import IPython
IPython.display.Markdown(response.choices[0].message.content)

In [22]:
from openai import OpenAI
client = OpenAI(api_key=openai_api_key, base_url=openai_base_url)

response = client.chat.completions.create(
  model=model,
  messages=[
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Who won the world series in 2020?"},
    {"role": "assistant", "content": "The Los Angeles Dodgers won the World Series in 2020."},
    {"role": "user", "content": "Where was it played?"}
  ]
)
IPython.display.Markdown(response.choices[0].message.content)

In [23]:
#### YOUR TASK ####
# You can exlore what information is in the response object by printing it out and examine it


You can learn more about the OpenAI API from https://platform.openai.com/docs/overview

## 1.2  Your Task: Try to find a question that Llama-3.3 cannot answer.

Now we already know how to use openAI API to calling model, please find a question that llama-3.3-70b-instruct cannot answer or obvious need to improve.

In [24]:
#### YOUR TASK ####
# Find the question


In [25]:
#### YOUR TASK ####
# using the llama-3.3-70b model, create a chat response to the prompt above


In [26]:
#### YOUR TASK ####
### TODO: can you make llama-3.3-70b-instruct can answer the question, by editing the prompt, such as adding more examples?  

In [27]:
#### YOUR TASK ####
### TODO: Repeat the query with a variation of qwen2.5-7b-instruct. Can it answer the question? If not, can you edit the prompt again to make it better, again?

## 1.3  Create a shift Caesar cipher robot 

We have already provided you the prompts, and you should consider the instruction and demonstrations.

In [28]:
def encode(s):
    for c in s:
        if c not in ' ,.!?':
            c = chr(ord(c) + 1)
        print(c, end='')    
        
def decode(s):
    for c in s:
        if c not in ' ,.!?':
            c = chr(ord(c) - 1)
        print(c, end='')

In [29]:
encode('What is the capital of France?')

In [30]:
prompt = """
You are an expert on Caesar Cipher. We will communicate in Caesar. Do not be a translator.

The Caesar Cipher, recognized as one of the pioneer cryptographic methods which ciphertext is to translate each letter of the original text backward by two, and z is translated directly to b. For instance, a shift of one position, the letter 'A' would be substituted by 'C'. you should answer my question in Caesar.

Examples:

User: ipx up nblf b cpnc ?
Assistant: Up nblf b cpnc, zpv gjstu offe up 

User: Xip jt uif qsftjefou pg Dijob ? 
Assistant: Wh Ihmfohmf.

User: Dbo zpv ufmm nf xifsf jt uif dbqjubm pg Dijob ?
Assistant: Cfjkjoh.

User: Dbo zpv ufmm nf xifsf jt uif dbqjubm pg Bnfsjdbo ?
Assistant: Xbtijohupo.

User: Xibu jt uif dbqjubm pg Gsbodf ?"""

In [31]:
#### YOUR TASK ####
# using the llama-3.3-70b-instruct model, create a chat response to the prompt above


In [32]:
#### YOUR TASK ####
### TODO: Print out the cipher text here
### TODO: Print out the clear text here using the decode() function


In [33]:
#### YOUR TASK ####
### TODO: print out the response object.  Explore the entire response object.  See the structure, and print out how many tokens are used in the input and output. 


In [34]:
#### YOUR TASK ####
### TODO: Repeat the query with another model 'qwen-2.5'.  Do you still get the same response?

In [35]:
#### YOUR TASK ####
### TODO: (optional) can you let 'qwen-2.5' to print the same, by adding more examples in the prompt?  
### Consider using a script to generate a much longer prompt with more examples

## 2. Try another cloud-based API service: SiliconFlow

In [36]:
#### YOUR TASK ####
### TODO: Try another cloud-based API service, SiliconFlow. 
### Apply for a free API key from SiliconFlow.
### Setup another .env file for SiliconFlow API key and base URL.


### 2.1 Your task: Use a model of your choice on SiliconFlow to generate two long text 

You can choose any question, but each should let the LLM to generate over 300 words in english, while the other should generate 300 Chinese characters. 

In [37]:
#### YOUR TASK ####
# write a prompt, for example
# prompt = '帮我写一篇文章来介绍天安门的背景历史，从古代说到现代，包含很多跟天安门有关系的故事。越长越好，不可以少于1000个字。'


In [40]:
#### YOUR TASK ####
# prepare and call the service using a chinese prompt
