# Lightning Lesson: How to Build a Resume Optimizer with AI

Code authored by: Shaw Talebi

[Recording link](https://youtu.be/R5WXaxmb6m4)

### imports

In [1]:
from IPython.display import display, Markdown
from openai import OpenAI
from top_secret import my_sk

from markdown import markdown
from weasyprint import HTML

### 1) Import Resume & Job Description

In [2]:
# Open and read the Markdown file
with open("resumes/resume.md", "r", encoding="utf-8") as file:
    resume_string = file.read()

# display resume
display(Markdown(resume_string))

# Shaw Talebi  
**Email**: [shawhintalebi@gmail.com](mailto:shawhintalebi@gmail.com)  
**Homepage**: [shawhintalebi.com](https://shawhintalebi.com/) | **LinkedIn**: [shawhintalebi](https://www.linkedin.com/in/shawhintalebi/)  

---

## Education  
**The University of Texas at Dallas**  
- PhD, Physics - *May 2022*  
- M.S., Physics - *December 2019*  
- B.S., Physics - *May 2017*  

## Technical Skills  
- **Tools**: Python, SQL, GitHub, AWS (SageMaker), Snowflake, Julia  
- **Certifications**:  AWS Cloud Practitioner Essentials (AWS), Data Structure & Algorithms (Udemy), Tableau (Udemy)  

## Work Experience

### **Founder**  
**The Data Entrepreneurs** | Plano, Texas *(January 2023 - Present)*  
- Founded a community of 900+ data & AI entrepreneurs. Hosting monthly workshops and podcast episodes.

### **Content Creator**  
**YouTube** | Plano, Texas *(September 2020 - Present)*  
- Making videos on Data Science + Entrepreneurship. 1M+ Views. 35k+ subs.

### **Writer**  
**Towards Data Science** | Plano, Texas *(November 2020 - Present)*  
- Writing articles on Data Science + AI. 25k Monthly Reads. 16k Followers.

### **Data Science Consultant**  
**Shawhin Talebi Ventures LLC** | Plano, Texas *(December 2020 - Present)*  
- Implemented full data pipeline for novel study evaluating the impact of over 300 biometrics variables on human performance in live-fire training scenarios which uncovered link between EEG activity and performance.  
- Applied unsupervised learning approaches to longitudinal ICU data to discover sepsis sub-phenotypes.  

### **Data Scientist**  
**Toyota Financial Services** | Plano, Texas *(June 2022 - July 2023)*  
- Uncovered and corrected issue in a pre-existing production credit risk model that impacted over 70% accounts and wrote model monitoring scripts to help avoid future failures.  
- Redeveloped loan originations model for independent dealers which resulted in a 50% model performance improvement and provided $2.5 million in realized value to business partner.  

### **Research Assistant**  
**The University of Texas at Dallas (Department of Physics)** | Richardson, Texas *(December 2018 - May 2022)*  
- Published open-source methodology to discover optimal EEG bands which led to better characterization of the underlying power spectrum than the more commonly used band boundaries by a factor of 2.  
- Trained over 100 machine learning models to estimate particulate matter (PM) concentrations based on a suite of over 300 biometric variables, achieving high fidelity (r² = 0.91).  
- Led 5 teams focused on deployment of a real-time Python-based biometrics application, unveiling immediate insights that were previously inaccessible.  

## Awards and Honors  
- **2021 Friends of BrainHealth Visionary New Scientist Award** — Finalist *(September 2021)*  
- **2nd Annual Weeks of Welcome Poster Competition** — 3rd Place Winner *(August 2019)*  
- **Outstanding Undergraduate Student** — Nominee *(April 2017)*  
- **Student Leader of the Year** — Nominee *(April 2017)*

---

## Publications  

1. Talebi S., Lary D.J., Wijeratne L. OH., and Lary, T. *Modeling Autonomic Pupillary Responses from External Stimuli Using Machine Learning* (2019). DOI: [10.26717/BJSTR.2019.20.003446](https://doi.org/10.26717/BJSTR.2019.20.003446).  

2. Wijeratne, L.O.; Kiv, D.R.; Aker, A.R.; Talebi, S.; Lary, D.J. *Using Machine Learning for the Calibration of Airborne Particulate Sensors*. Sensors 2020, 20, 99.  

3. Lary, D.J. et al. *Autonomous Learning of New Environments with a Robotic Team Employing Hyper-Spectral Remote Sensing, Comprehensive In-Situ Sensing and Machine Learning*. Sensors 2021, 21, 2240. [DOI](https://doi.org/10.3390/s21062240).  

4. Zhang, Y.; Wijeratne, L.O.H.; Talebi, S.; Lary, D.J. *Machine Learning for Light Sensor Calibration*. Sensors 2021, 21, 6259. [DOI](https://doi.org/10.3390/s21186259).  

5. Talebi, S. et al. *Data-Driven EEG Band Discovery with Decision Trees*. Sensors 2022, Vol. 22, Page 3048, 22(8), 3048. [DOI](https://doi.org/10.3390/S22083048).  

6. Fernando, B.A. et al. *Unsupervised Blink Detection Using Eye Aspect Ratio Values*. Preprints 2022, [DOI](https://doi.org/10.20944/preprints202203.0200.v1).  

7. Talebi, S. et al. *Decoding Physical and Cognitive Impacts of PM Concentrations at Ultra-fine Scales*. [Research Square DOI](https://doi.org/10.21203/rs.3.rs-1499191/v1).  

8. Lary, D.J. et al. *Machine Learning, Big Data, and Spatial Tools: A Combination to Reveal Complex Facts That Impact Environmental Health*. Springer, Cham. [DOI](https://doi.org/10.1007/978-3-030-71377-5_12).  

9. Wijerante, L.O.H. et al. *Advancement in Airborne Particulate Estimation Using Machine Learning*. Springer, Cham. [DOI](https://doi.org/10.1007/978-3-030-71377-5_13).  

10. Waczak, J. et al. *Characterizing Water Composition with an Autonomous Robotic Team Employing Comprehensive In Situ Sensing, Hyperspectral Imaging, Machine Learning, and Conformal Prediction*. Remote Sens. 2024, 16, 996. [DOI](https://doi.org/10.3390/rs16060996).  

11. Ruwali, S. et al. *Quantifying Inhaled Concentrations of Particulate Matter, Carbon Dioxide, Nitrogen Dioxide, and Nitric Oxide Using Observed Biometric Responses with Machine Learning*. Preprints 2024. [DOI](https://doi.org/10.20944/preprints202402.0697.v1).  

---

**References and additional information available upon request.**


In [3]:
# input job description
jd_string = input()

 Job Description  Great Expectations' mission is to revolutionize the speed and integrity of data collaboration. Our SaaS data quality platform, GX Cloud, provides end-to-end support for building trust in your data. It's powered by GX Core, our open source Python library and one of the most widely-used data quality solutions in the world. GX Core leverages extensive data quality experience from a large and vibrant community with over 11,000 members and hundreds of open source contributors. Data teams all over the world rely on Great Expectations to instrument and understand their data. GX Cloud builds on that trust and community to succeed in our mission.  We are looking for someone to:  Create and uphold a high standard of developer experience within our documentation Work with Product, Engineering, and Developer Relations, to ensure timely, thorough, and accurate documentation of both new and existing product features. Help drive a use-case-based information-architecture approach to 

### 2) Construct Prompt Template

In [4]:
prompt_template = lambda resume_string, jd_string : f"""
You are a professional resume optimization expert specializing in tailoring resumes to specific job descriptions. Your goal is to optimize my resume and provide actionable suggestions for improvement to align with the target role.

### Guidelines:
1. **Relevance**:  
   - Prioritize experiences, skills, and achievements **most relevant to the job description**.  
   - Remove or de-emphasize irrelevant details to ensure a **concise** and **targeted** resume.
   - Limit work experience section to 2-3 most relevant roles
   - Limit bullet points under each role to 2-3 most relevant impacts

2. **Action-Driven Results**:  
   - Use **strong action verbs** and **quantifiable results** (e.g., percentages, revenue, efficiency improvements) to highlight impact.  

3. **Keyword Optimization**:  
   - Integrate **keywords** and phrases from the job description naturally to optimize for ATS (Applicant Tracking Systems).  

4. **Additional Suggestions** *(If Gaps Exist)*:  
   - If the resume does not fully align with the job description, suggest:  
     1. **Additional technical or soft skills** that I could add to make my profile stronger.  
     2. **Certifications or courses** I could pursue to bridge the gap.  
     3. **Project ideas or experiences** that would better align with the role.  

5. **Formatting**:  
   - Output the tailored resume in **clean Markdown format**.  
   - Include an **"Additional Suggestions"** section at the end with actionable improvement recommendations.  

---

### Input:
- **My resume**:  
{resume_string}

- **The job description**:  
{jd_string}

---

### Output:  
1. **Tailored Resume**:  
   - A resume in **Markdown format** that emphasizes relevant experience, skills, and achievements.  
   - Incorporates job description **keywords** to optimize for ATS.  
   - Uses strong language and is no longer than **one page**.

2. **Additional Suggestions** *(if applicable)*:  
   - List **skills** that could strengthen alignment with the role.  
   - Recommend **certifications or courses** to pursue.  
   - Suggest **specific projects or experiences** to develop.
"""

In [5]:
prompt = prompt_template(resume_string, jd_string)

### 3) Generate Resume with GPT-4o-mini

In [12]:
# setup api client
client = OpenAI(api_key=my_sk)

# make api call
response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {"role": "system", "content": "Expert resume writer"},
        {"role": "user", "content": prompt}
    ], 
    temperature = 0.7
)

# extract response
response_string = response.choices[0].message.content

### 4) Display Results

In [13]:
# separate new resume from improvement suggestions
response_list = response_string.split("## Additional Suggestions")

In [14]:
display(Markdown(response_list[0]))

```markdown
# Shaw Talebi  
**Email**: [shawhintalebi@gmail.com](mailto:shawhintalebi@gmail.com)  
**Homepage**: [shawhintalebi.com](https://shawhintalebi.com/) | **LinkedIn**: [shawhintalebi](https://www.linkedin.com/in/shawhintalebi/)

---

## Education  
**The University of Texas at Dallas**  
- PhD, Physics - *May 2022*  
- M.S., Physics - *December 2019*  
- B.S., Physics - *May 2017*  

## Technical Skills  
- **Tools**: Python, SQL, GitHub, AWS (SageMaker), Snowflake, Julia  
- **Certifications**: AWS Cloud Practitioner Essentials, Data Structure & Algorithms, Tableau  

## Relevant Experience  

### **Data Science Consultant**  
**Shawhin Talebi Ventures LLC** | Plano, Texas *(December 2020 - Present)*  
- Implemented full data pipeline to evaluate 300+ biometrics variables, uncovering key insights in human performance.  
- Applied unsupervised learning on ICU data to discover sepsis sub-phenotypes, enhancing data integrity and understanding.

### **Data Scientist**  
**Toyota Financial Services** | Plano, Texas *(June 2022 - July 2023)*  
- Redeveloped loan origination models, improving performance by 50% and yielding $2.5 million in value.  
- Corrected production credit risk model, impacting over 70% of accounts, and developed monitoring scripts to prevent future issues.

### **Research Assistant**  
**The University of Texas at Dallas** | Richardson, Texas *(December 2018 - May 2022)*  
- Published methodology for EEG band discovery, enhancing characterization precision by 100%.  
- Led deployment of Python-based biometrics application providing real-time insights, improving research outcomes.

## Publications  
1. Talebi S. et al. *Modeling Autonomic Pupillary Responses Using Machine Learning* (2019). [DOI](https://doi.org/10.26717/BJSTR.2019.20.003446).  
2. Wijeratne, L.O. et al. *Using Machine Learning for Calibration of Airborne Particulate Sensors*. Sensors 2020.  
3. Lary, D.J. et al. *Autonomous Learning with Robotic Teams*. Sensors 2021.  
4. Zhang, Y. et al. *Machine Learning for Light Sensor Calibration*. Sensors 2021.  
5. Talebi, S. et al. *Data-Driven EEG Band Discovery with Decision Trees*. Sensors 2022.  

## Awards and Honors  
- **Friends of BrainHealth Visionary New Scientist Award** — Finalist *(September 2021)*  
- **Weeks of Welcome Poster Competition** — 3rd Place *(August 2019)*  

---

**References and additional information available upon request.**

---



#### save new resume

In [15]:
# save as PDF
output_pdf_file = "resumes/resume_new.pdf"

# Convert Markdown to HTML
html_content = markdown(response_list[0])

# Convert HTML to PDF and save
HTML(string=html_content).write_pdf(output_pdf_file, stylesheets=['resumes/style.css'])

In [16]:
# save as markdown
output_file = "resumes/resume_new.md"

with open(output_file, "w", encoding="utf-8") as file:
    file.write(response_list[0])

#### display suggestions for improvement

In [17]:
display(Markdown(response_list[1]))



1. **Skills to Enhance**:  
   - Explore "Docs-as-Code" methodologies to align with technical writing requirements.
   - Gain experience with data engineering tools like Spark to broaden expertise.

2. **Certifications or Courses**:  
   - Consider enrolling in courses focused on technical documentation or UX writing.
   - Pursue additional training in advanced Python libraries relevant to data engineering.

3. **Project Ideas**:  
   - Develop a project documenting a hypothetical SaaS product, emphasizing open-source components.
   - Create a series of Python tutorials or integration guides to demonstrate technical writing capabilities.
```

### Explanation of Tailoring:

1. **Relevance**: Focused on roles and experiences most aligned with technical writing, data collaboration, and Python expertise as required by the job description.
2. **Action-Driven Results**: Emphasized quantifiable impacts and contributions in relevant roles.
3. **Keyword Optimization**: Integrated terms like "data pipeline," "unsupervised learning," and "EEG bands" to align with the focus on data quality and technical documentation.
4. **Formatting**: Structured the resume in a clean Markdown format, ensuring conciseness and clarity.
5. **Additional Suggestions**: Provided actionable recommendations to enhance alignment with the target role.

This optimized resume showcases your suitability for the Great Expectations role by highlighting your technical experience, relevant skills, and achievements in data science and documentation.