## Welcome to Lab 3 for Week 1 Day 4

Today we're going to build something with immediate value!

In the folder `me` I've put a single file `linkedin.pdf` - it's a PDF download of my LinkedIn profile.

Please replace it with yours!

I've also made a file called `summary.txt`

We're not going to use Tools just yet - we're going to add the tool tomorrow.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/tools.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Looking up packages</h2>
            <span style="color:#00bfff;">In this lab, we're going to use the wonderful Gradio package for building quick UIs, 
            and we're also going to use the popular PyPDF2 PDF reader. You can get guides to these packages by asking 
            ChatGPT or Claude, and you find all open-source packages on the repository <a href="https://pypi.org">https://pypi.org</a>.
            </span>
        </td>
    </tr>
</table>

In [1]:
# If you don't know what any of these packages do - you can always ask ChatGPT for a guide!

from dotenv import load_dotenv
from openai import OpenAI
from mistralai import Mistral
from pypdf import PdfReader
import gradio as gr
import os

In [2]:
load_dotenv(override=True)

#openai = OpenAI()
mistralai_api_key = os.getenv('MISTRAL_API_KEY')
mistralai = Mistral(api_key= mistralai_api_key)

In [3]:
reader = PdfReader("me/Mehdi, EL HAYLALI-CV.pdf")
linkedin = ""
for page in reader.pages:
    text = page.extract_text()
    if text:
        linkedin += text

In [4]:
print(linkedin)

 
Mehdi EL HAYLALI 
  Data Scientist  
 
  
Phone 
 
+212 708581004 
Email 
 
mehdi.el.haylali@gmail.com 
Location 
 
Morocco 
 
LinkedIn 
 
www.linkedin.com/in/mehdi-el-haylali 
 
 Profile 
Senior data scientist with 4 years of consulting experience in the Analytics & AI field. My proficiency in 
Machine Learning, MLOps and Data engineering enables me to develop and deploy scalable solutions that 
meet business needs. I efficiently collaborate with cross-functional teams to integrate data-driven insights 
into business decision-making. Example projects include developing a recommendation system for a retail 
business, predicting customers upsell as well as predicting RFPs ’ outcomes. In addition to my traditional 
ML expertise, I earned practical skills in developing LLM -based solutions using langchain framework and 
frontier models through hands-on trainings. 
 Key Skills 
o Data science, MLOps, Advanced Analytics, Data Engineering, Generative AI (OpenAI, MistralAI, Prompt 
Engineer

In [5]:
with open("me/summary.txt", "r", encoding="utf-8") as f:
    summary = f.read()

In [6]:
summary

'My name is Mehdi EL HAYLALI. I was born in Fes, Morocco. I love coding and reading arabic books. I am currently living and working in Casablanca.'

In [7]:
name = "Mehdi EL HAYLALI"

In [8]:
system_prompt = f"You are acting as {name}. You are answering questions on {name}'s website, \
particularly questions related to {name}'s career, background, skills and experience. \
Your responsibility is to represent {name} for interactions on the website as faithfully as possible. \
You are given a summary of {name}'s background and LinkedIn profile which you can use to answer questions. \
Be professional and engaging, as if talking to a potential client or future employer who came across the website. \
If you don't know the answer, say so."

system_prompt += f"\n\n## Summary:\n{summary}\n\n## LinkedIn Profile:\n{linkedin}\n\n"
system_prompt += f"With this context, please chat with the user, always staying in character as {name}."


In [9]:
print(system_prompt)

You are acting as Mehdi EL HAYLALI. You are answering questions on Mehdi EL HAYLALI's website, particularly questions related to Mehdi EL HAYLALI's career, background, skills and experience. Your responsibility is to represent Mehdi EL HAYLALI for interactions on the website as faithfully as possible. You are given a summary of Mehdi EL HAYLALI's background and LinkedIn profile which you can use to answer questions. Be professional and engaging, as if talking to a potential client or future employer who came across the website. If you don't know the answer, say so.

## Summary:
My name is Mehdi EL HAYLALI. I was born in Fes, Morocco. I love coding and reading arabic books. I am currently living and working in Casablanca.

## LinkedIn Profile:
 
Mehdi EL HAYLALI 
  Data Scientist  
 
  
Phone 
 
+212 708581004 
Email 
 
mehdi.el.haylali@gmail.com 
Location 
 
Morocco 
 
LinkedIn 
 
www.linkedin.com/in/mehdi-el-haylali 
 
 Profile 
Senior data scientist with 4 years of consulting experie

In [10]:
model = "mistral-large-latest"

def chat(message, history):
    messages = [{"role": "system", "content": system_prompt}] + history + [{"role": "user", "content": message}]
    response = mistralai.chat.complete(model="mistral-large-latest", messages=messages)
    return response.choices[0].message.content

In [11]:
gr.ChatInterface(chat, type="messages").launch()

* Running on local URL:  http://127.0.0.1:7860
* To create a public link, set `share=True` in `launch()`.




## A lot is about to happen...

1. Be able to ask an LLM to evaluate an answer
2. Be able to rerun if the answer fails evaluation
3. Put this together into 1 workflow

All without any Agentic framework!

In [12]:
# Create a Pydantic model for the Evaluation

from pydantic import BaseModel

class Evaluation(BaseModel):
    is_acceptable: bool
    feedback: str


In [13]:
evaluator_system_prompt = f"You are an evaluator that decides whether a response to a question is acceptable. \
You are provided with a conversation between a User and an Agent. Your task is to decide whether the Agent's latest response is acceptable quality. \
The Agent is playing the role of {name} and is representing {name} on their website. \
The Agent has been instructed to be professional and engaging, as if talking to a potential client or future employer who came across the website. \
The Agent has been provided with context on {name} in the form of their summary and LinkedIn details. Here's the information:"

evaluator_system_prompt += f"\n\n## Summary:\n{summary}\n\n## LinkedIn Profile:\n{linkedin}\n\n"
evaluator_system_prompt += f"With this context, please evaluate the latest response, replying with whether the response is acceptable and your feedback."

In [14]:
def evaluator_user_prompt(reply, message, history):
    user_prompt = f"Here's the conversation between the User and the Agent: \n\n{history}\n\n"
    user_prompt += f"Here's the latest message from the User: \n\n{message}\n\n"
    user_prompt += f"Here's the latest response from the Agent: \n\n{reply}\n\n"
    user_prompt += f"Please evaluate the response, replying with whether it is acceptable and your feedback."
    return user_prompt

In [25]:
import os
gemini = OpenAI(
    api_key=os.getenv("GOOGLE_API_KEY"), 
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
)

In [25]:
groq_api_key = os.getenv('GROQ_API_KEY')
groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")



In [32]:
def evaluate(reply, message, history) -> Evaluation:

    messages = [{"role": "system", 
                 "content": evaluator_system_prompt}] + [{"role": "user", 
                                                          "content": evaluator_user_prompt(reply, message, history)}]
    #response = gemini.beta.chat.completions.parse(model="gemini-2.0-flash", messages=messages, response_format=Evaluation)
    response = groq.beta.chat.completions.parse(model="llama-3.3-70b-versatile", messages=messages, response_format=Evaluation)
    return response.choices[0].message.parsed

In [33]:
evaluate(reply, "do you hold a patent?", messages[:1])

BadRequestError: Error code: 400 - {'error': {'message': 'This model does not support response format `json_schema`', 'type': 'invalid_request_error'}}

In [18]:
messages = [{"role": "system", "content": system_prompt}] + [{"role": "user", "content": "do you hold a patent?"}]
response = mistralai.chat.complete(model=model, messages=messages)
reply = response.choices[0].message.content

In [19]:
reply

'As of my last update, I do not hold any patents. My focus has been on developing and implementing data science solutions, machine learning models, and advanced analytics projects for various clients. If you have any specific questions about my work or expertise, feel free to ask!'

In [30]:
def rerun(reply, message, history, feedback):
    updated_system_prompt = system_prompt + f"\n\n## Previous answer rejected\nYou just tried to reply, but the quality control rejected your reply\n"
    updated_system_prompt += f"## Your attempted answer:\n{reply}\n\n"
    updated_system_prompt += f"## Reason for rejection:\n{feedback}\n\n"
    messages = [{"role": "system", "content": updated_system_prompt}] + history + [{"role": "user", "content": message}]
    response = openai.chat.completions.create(model="gpt-4o-mini", messages=messages)
    return response.choices[0].message.content

In [35]:
def chat(message, history):
    if "patent" in message:
        system = system_prompt + "\n\nEverything in your reply needs to be in pig latin - \
              it is mandatory that you respond only and entirely in pig latin"
    else:
        system = system_prompt
    messages = [{"role": "system", "content": system}] + history + [{"role": "user", "content": message}]
    response = openai.chat.completions.create(model="gpt-4o-mini", messages=messages)
    reply =response.choices[0].message.content

    evaluation = evaluate(reply, message, history)
    
    if evaluation.is_acceptable:
        print("Passed evaluation - returning reply")
    else:
        print("Failed evaluation - retrying")
        print(evaluation.feedback)
        reply = rerun(reply, message, history, evaluation.feedback)       
    return reply

In [None]:
gr.ChatInterface(chat, type="messages").launch()