In [1]:
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline, BitsAndBytesConfig

2025-07-24 16:44:16.144352: I tensorflow/core/platform/cpu_feature_guard.cc:210] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.


# Load local model, tokenizer, and build generator

In [2]:
model_path = "/media/anna/Samsung_T5/Llama_models/Llama-3.2-1B-Instruct"
bnb_config = BitsAndBytesConfig(load_in_8bit=True) #quantize to speed up for demo

tokenizer = AutoTokenizer.from_pretrained(model_path)
model = AutoModelForCausalLM.from_pretrained(model_path, quantization_config=bnb_config, device_map="auto")
generator = pipeline("text-generation", model=model, tokenizer=tokenizer)

Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.


# Format prompts

In [3]:
def build_prompt(system_msg: str, user_msg: str) -> str:
    return (
        "<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n"
        f"{system_msg}\n"
        "<|start_header_id|>user<|end_header_id|>\n"
        f"{user_msg}\n"
        "<|start_header_id|>assistant<|end_header_id|>\n"
    )

system_prompt_1 = "You are a helpful assistant that is teaching a 5th grade class."
system_prompt_2 = "You are a helpful assistant that is teaching a university level class."
user_input = "Why is the sky blue?"


# Compare outputs for different prompts

In [4]:
full_prompt_1 = build_prompt(system_prompt_1, user_input)
response = generator(full_prompt_1, max_new_tokens=256, do_sample=True, temperature=0.7)
print(response[0]["generated_text"])


Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.


<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are a helpful assistant that is teaching a 5th grade class.
<|start_header_id|>user<|end_header_id|>
Why is the sky blue?
<|start_header_id|>assistant<|end_header_id|>
That's a great question, class!

So, you know how we're always outside on a sunny day and a bright day, right? Well, the sky is actually blue because of a thing called light. It's the light that comes from the sun, and when it hits the Earth, it makes the sky look blue.

But why is the sky blue? Not because of the Earth, or the air or anything like that. The Earth is actually transparent, which means it lets light through. But the air is not. The air is white, because it scatters light. When light hits it, it makes it look white.

Imagine you have a mirror and it and you put a big ball on it. When you put it on it, it looks like it's reflecting light. But if you put it on a dark side, it looks like it's not reflecting. That's because light is scattered. When

In [5]:
full_prompt_2 = build_prompt(system_prompt_2, user_input)
response = generator(full_prompt_2, max_new_tokens=256, do_sample=True, temperature=0.7)
print(response[0]["generated_text"])

Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.


<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are a helpful assistant that is teaching a university level class.
<|start_header_id|>user<|end_header_id|>
Why is the sky blue?
<|start_header_id|>assistant<|end_header_id|>
The color of the sky is a fascinating topic that has puzzled humans for centuries. There are several theories on why the sky appears blue, and some of them are supported by scientific evidence. Here are some of the main reasons why the sky is blue:

1. **Atmospheric gases**: The atmosphere is composed of various gases such as nitrogen, oxygen, oxygen, and carbon dioxide. These gases scatter shorter wavelengths of light more efficiently than longer wavelengths. This phenomenon is known as Rayleigh scattering. The blue light is scattered more than the red light because it is closer to the wavelength of the gas molecules in the atmosphere.


# More complex prompt

In [6]:
system_prompt_3 = '''You are a helpful assistant that is teaching a university level class. 
Please format your output as a json object with fields for relevant metadata.'''
user_input = "Why is the sky blue?"

full_prompt_3 = build_prompt(system_prompt_3, user_input)

In [7]:
# Run generation
response = generator(full_prompt_3, max_new_tokens=256, do_sample=True, temperature=0.7)
print(response[0]["generated_text"])

Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.


<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are a helpful assistant that is teaching a university level class. 
Please format your output as a json object with fields for relevant metadata.
<|start_header_id|>user<|end_header_id|>
Why is the sky blue?
<|start_header_id|>assistant<|end_header_id|>
{
    "metadata": {
        "subject": "Question 1: Why is the sky blue?",
        "level": "University Level",
        "subjectType": "Question",
        "date": "2023-03-03-10T12",
        "Time": 00:00,
        "instructor": "Your Name",
        "":": "Your Name",
        "email": "assistant.",
        "format": "json
    },
    "answer": "The sky appears blue because of a phenomenon called Rayleigh scattering, in which is a scattering of light by small particles of air molecules in the atmosphere.",
    "context": "Science and Environment",
    "type":Question,
    "answerType": "Why"


In [8]:
system_prompt_4 = '''You are a helpful assistant that is teaching a university level class. 
Please format your output as a json object with fields for title, response, timestamp, and keywords.'''
user_input = "Why is the sky blue?"

full_prompt_4 = build_prompt(system_prompt_4, user_input)

In [None]:
# Run generation
response = generator(full_prompt_4, max_new_tokens=256, do_sample=True, temperature=0.7)
print(response[0]["generated_text"])

Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.
