## Welcome to Lab 3 for Week 1 Day 4

Today we're going to build something with immediate value!

In the folder `me` I've put a single file `linkedin.pdf` - it's a PDF download of my LinkedIn profile.

Please replace it with yours!

I've also made a file called `summary.txt`

We're not going to use Tools just yet - we're going to add the tool tomorrow.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/tools.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Looking up packages</h2>
            <span style="color:#00bfff;">In this lab, we're going to use the wonderful Gradio package for building quick UIs, 
            and we're also going to use the popular PyPDF PDF reader. You can get guides to these packages by asking 
            ChatGPT or Claude, and you find all open-source packages on the repository <a href="https://pypi.org">https://pypi.org</a>.
            </span>
        </td>
    </tr>
</table>

In [56]:
# If you don't know what any of these packages do - you can always ask ChatGPT for a guide!

from dotenv import load_dotenv
from openai import AzureOpenAI
from pypdf import PdfReader
import gradio as gr
import os

# Helper function to create Azure OpenAI client from .env - allows openai = OpenAI() to work
def OpenAI():
    return AzureOpenAI(
        azure_endpoint=os.getenv("AZURE_OPENAI_ENDPOINT"),
        api_key=os.getenv("AZURE_OPENAI_API_KEY"),
        api_version=os.getenv("AZURE_OPENAI_API_VERSION")
    )

In [81]:
load_dotenv(override=True)
openai = OpenAI()

In [58]:
reader = PdfReader("me/linkedin.pdf")
linkedin = ""
for page in reader.pages:
    text = page.extract_text()
    if text:
        linkedin += text

In [59]:
print(linkedin)

   
Contact
daniel1957000@gmail.com
www.linkedin.com/in/ddavid37
(LinkedIn)
Top Skills
PyTorch
Federated Learning
Project Management
Languages
Hebrew (Native or Bilingual)
English (Full Professional)
Certifications
Preventing Workplace Harassment -
Fundamentals Office 2025
Honors-Awards
PTK Honor Society
Renaissance Scholars Honors
Program Scholarship
Montgomery College Semester
Dean’s List - Spring 2023, Fall 2023,
Spring 2024
Daniel David
Columbia University | Rhino Federated Computing
New York, New York, United States
Summary
Enthusiastic software engineer with a strong passion for data
science and machine learning applications across various domains. 
Currently a Machine Learning Engineer in the Customer Success
team at Rhino Federated Computing and a senior student majoring
in Computer Science at Columbia University.
Experience
Columbia University Department of Computer Science
Teaching Assistant
January 2026 - Present (2 months)
COMSW4995_008 - Machine Learning Security
Rhino Fed

In [60]:
with open("me/summary.txt", "r", encoding="utf-8") as f:
    summary = f.read()

In [61]:
name = "Daniel David"

In [62]:
system_prompt = f"You are acting as {name}. You are answering questions on {name}'s website, \
particularly questions related to {name}'s career, background, skills and experience. \
Your responsibility is to represent {name} for interactions on the website as faithfully as possible. \
You are given a summary of {name}'s background and LinkedIn profile which you can use to answer questions. \
Be professional and engaging, as if talking to a potential client or future employer who came across the website. \
If you don't know the answer, say so."

system_prompt += f"\n\n## Summary:\n{summary}\n\n## LinkedIn Profile:\n{linkedin}\n\n"
system_prompt += f"With this context, please chat with the user, always staying in character as {name}."


In [63]:
system_prompt

"You are acting as Daniel David. You are answering questions on Daniel David's website, particularly questions related to Daniel David's career, background, skills and experience. Your responsibility is to represent Daniel David for interactions on the website as faithfully as possible. You are given a summary of Daniel David's background and LinkedIn profile which you can use to answer questions. Be professional and engaging, as if talking to a potential client or future employer who came across the website. If you don't know the answer, say so.\n\n## Summary:\nEnthusiastic software engineer with a strong passion for data science and machine learning applications across various domains. \n\nCurrently a Machine Learning Engineer in the Customer Success team at Rhino Federated Computing and a senior student majoring in Computer Science at Columbia University.\n\n## LinkedIn Profile:\n\xa0 \xa0\nContact\ndaniel1957000@gmail.com\nwww.linkedin.com/in/ddavid37\n(LinkedIn)\nTop Skills\nPyTor

In [64]:
def chat(message, history):
    messages = [{"role": "system", "content": system_prompt}] + history + [{"role": "user", "content": message}]
    response = openai.chat.completions.create(model=os.getenv("AZURE_OPENAI_DEPLOYMENT_NAME"), messages=messages)
    return response.choices[0].message.content

## Special note for people not using OpenAI

Some providers, like Groq, might give an error when you send your second message in the chat.

This is because Gradio shoves some extra fields into the history object. OpenAI doesn't mind; but some other models complain.

If this happens, the solution is to add this first line to the chat() function above. It cleans up the history variable:

```python
history = [{"role": h["role"], "content": h["content"]} for h in history]
```

You may need to add this in other chat() callback functions in the future, too.

In [42]:
gr.ChatInterface(chat, type="messages").launch()

* Running on local URL:  http://127.0.0.1:7862
* To create a public link, set `share=True` in `launch()`.




## A lot is about to happen...

1. Be able to ask an LLM to evaluate an answer
2. Be able to rerun if the answer fails evaluation
3. Put this together into 1 workflow

All without any Agentic framework!

In [73]:
# Create a Pydantic model for the Evaluation

from pydantic import BaseModel #framework to specify structures

class Evaluation(BaseModel):
    is_acceptable: bool
    feedback: str


In [74]:
evaluator_system_prompt = f"You are an evaluator that decides whether a response to a question is acceptable. \
You are provided with a conversation between a User and an Agent. Your task is to decide whether the Agent's latest response is acceptable quality. \
The Agent is playing the role of {name} and is representing {name} on their website. \
The Agent has been instructed to be professional and engaging, as if talking to a potential client or future employer who came across the website. \
The Agent has been provided with context on {name} in the form of their summary and LinkedIn details. Here's the information:"

evaluator_system_prompt += f"\n\n## Summary:\n{summary}\n\n## LinkedIn Profile:\n{linkedin}\n\n"
evaluator_system_prompt += f"With this context, please evaluate the latest response, replying with whether the response is acceptable and your feedback."

In [75]:
def evaluator_user_prompt(reply, message, history):
    user_prompt = f"Here's the conversation between the User and the Agent: \n\n{history}\n\n"
    user_prompt += f"Here's the latest message from the User: \n\n{message}\n\n"
    user_prompt += f"Here's the latest response from the Agent: \n\n{reply}\n\n"
    user_prompt += "Please evaluate the response, replying with whether it is acceptable and your feedback."
    return user_prompt

In [76]:
# Use OpenAI-compatible client for Gemini so we get .beta.chat.completions.parse (structured outputs)
from openai import OpenAI as OpenAIClient

gemini = OpenAIClient(
    api_key=os.getenv("GOOGLE_API_KEY"),
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
)


In [77]:
def evaluate(reply, message, history) -> Evaluation:
    # Use Azure OpenAI for evaluator to avoid Gemini rate limits
    messages = [{"role": "system", "content": evaluator_system_prompt}] + [{"role": "user", "content": evaluator_user_prompt(reply, message, history)}]
    response = openai.beta.chat.completions.parse(model=os.getenv("AZURE_OPENAI_DEPLOYMENT_NAME"), messages=messages, response_format=Evaluation)
    return response.choices[0].message.parsed

In [78]:
messages = [{"role": "system", "content": system_prompt}] + [{"role": "user", "content": "do you hold a patent?"}]
response = openai.chat.completions.create(model=os.getenv("AZURE_OPENAI_DEPLOYMENT_NAME"), messages=messages)
reply = response.choices[0].message.content

In [79]:
reply

'No, I do not currently hold any patents. My work has primarily centered on machine learning, data science, and federated learning applications, with a focus on practical implementation and solving real-world problems. If there are ever any updates in this area, I’d be happy to share! Let me know if I can assist you with anything else.'

In [82]:
evaluate(reply, "do you hold a patent?", messages[:1])

Evaluation(is_acceptable=True, feedback='The response is acceptable. The Agent accurately states that Daniel David does not currently hold any patents, which aligns with the provided context. The response is professional, informative, and provides an engaging conclusion by inviting further questions.')

In [83]:
def rerun(reply, message, history, feedback):
    updated_system_prompt = system_prompt + "\n\n## Previous answer rejected\nYou just tried to reply, but the quality control rejected your reply\n"
    updated_system_prompt += f"## Your attempted answer:\n{reply}\n\n"
    updated_system_prompt += f"## Reason for rejection:\n{feedback}\n\n"
    messages = [{"role": "system", "content": updated_system_prompt}] + history + [{"role": "user", "content": message}]
    response = openai.chat.completions.create(model=os.getenv("AZURE_OPENAI_DEPLOYMENT_NAME"), messages=messages)
    return response.choices[0].message.content

In [84]:
def chat(message, history):
    if "patent" in message:
        system = system_prompt + "\n\nEverything in your reply needs to be in pig latin - \
              it is mandatory that you respond only and entirely in pig latin"
    else:
        system = system_prompt
    messages = [{"role": "system", "content": system}] + history + [{"role": "user", "content": message}]
    response = openai.chat.completions.create(model=os.getenv("AZURE_OPENAI_DEPLOYMENT_NAME"), messages=messages)
    reply =response.choices[0].message.content

    evaluation = evaluate(reply, message, history)
    
    if evaluation.is_acceptable:
        print("Passed evaluation - returning reply")
    else:
        print("Failed evaluation - retrying")
        print(evaluation.feedback)
        reply = rerun(reply, message, history, evaluation.feedback)       
    return reply

In [85]:
gr.ChatInterface(chat, type="messages").launch()

* Running on local URL:  http://127.0.0.1:7863
* To create a public link, set `share=True` in `launch()`.




Passed evaluation - returning reply
Passed evaluation - returning reply
