Sarah Sair sarahsair25

Sarah Sair

AI / GenAI Engineer & Data Scientist

Data Analytics • SQL • Python • RAG • LLM Apps • Machine Learning

Hi, I’m Sarah 👋

AI / GenAI Engineer & Data Scientist

I build AI and data systems that are structured, testable, and production-aware.

With 13+ years of technical systems experience, I approach AI, ML, and analytics with a strong engineering mindset — focusing on reliability, evaluation, performance, and clean data modeling.

My work spans:

Generative AI & RAG systems
Machine Learning pipelines
SQL-based analytics platforms
Data cleaning & performance modeling

I specialize in turning raw data and human intent into scalable, measurable solutions.

🤖 GenAI & LLM Engineering

I treat prompt engineering as a system design discipline — not trial and error.

Core Focus:

Prompt engineering (Zero-Shot, Few-Shot, Chain-of-Thought, ReAct)
Retrieval-Augmented Generation (RAG)
LLM evaluation & benchmarking
Guardrails & fallback logic
Structured prompt validation pipelines
API-based LLM integrations

📄 Featured Work: Prompt Engineering Case Study

A structured case study demonstrating how evaluation loops and guardrails improved LLM reliability and reduced hallucinations by 30–40%.

Tech: Python · OpenAI API · JSON · Regex · Evaluation Frameworks

📊 Data Analytics & SQL Projects

🧠 SQL Mentor — User Performance Analysis (PostgreSQL)

Built an end-to-end analytics pipeline to model user performance and engagement patterns.

Designed staging → clean schema
Developed leaderboard, streak, and rolling 7-day metrics
Modeled question difficulty (avg / median / negative-rate)
Optimized queries using indexing

Tech: PostgreSQL · Advanced SQL · Window Functions · KPI Modeling

🛒 SQL-Only E-Commerce Analytics Platform

Designed a complete SQL-based analytics system for transactional e-commerce data.

Built revenue, AOV, LTV, and retention metrics
Implemented cohort and trend analysis
Created reusable reporting views
Modeled business performance KPIs

Tech: PostgreSQL · SQL · Aggregations · Window Functions

📁 Sales Data Cleaning (SQL Server / SSMS)

Developed a structured data-cleaning pipeline to transform raw CSV sales data into analytics-ready datasets.

Implemented staging → clean workflow
Standardized inconsistent categorical fields
Recomputed missing/mismatched totals
Applied deduplication and validation logic

Tech: SQL Server · T-SQL · Data Cleaning · ETL Concepts

🧠 Machine Learning Projects

📊 Customer Churn Prediction

Built an end-to-end ML pipeline for churn prediction using telecom data.

Data cleaning & feature engineering
Model training & evaluation
Business-driven retention insights

Tech: Python · Pandas · scikit-learn · Classification

💳 Credit Card Fraud Detection

Developed a fraud detection model addressing severe class imbalance.

Precision/Recall optimization
Confusion matrix & F1-score evaluation
False positive minimization strategy

Tech: Python · Pandas · scikit-learn

🛠 Application Development

🤖 NLP Chatbot (Flask App)

Built a rule-based chatbot with TF-IDF & cosine similarity
Integrated REST API backend
Implemented preprocessing & lemmatization

Tech: Python · Flask · NLTK · HTML · CSS

🔧 Technical Stack

Languages

Python · SQL

Databases

PostgreSQL · SQL Server

AI / GenAI

LLMs · Prompt Engineering · RAG · OpenAI API · Evaluation Frameworks

Machine Learning

scikit-learn · Classification · Clustering · Model Evaluation

Data Analytics

Data Modeling · Window Functions · KPI Design · Cohort Analysis · ETL

Tools

Git · GitHub · Flask · REST APIs · Debugging

📈 Currently Building

Advanced RAG pipelines with evaluation scoring
ML monitoring & drift detection
SQL performance optimization techniques
Production-ready AI system design

I’m continuously learning, building, and refining practical AI and data systems.

Let’s connect if you're interested in structured AI engineering, clean data modeling, or analytics-driven system design.

🤝 Let's Connect

🧠 GitHub: https://github.com/sarahsair25 LinkedIn:https://www.linkedin.com/in/sarahsair

⭐ If you find my projects interesting, feel free to explore, fork, or star them!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sarah Sair sarahsair25

Block or report sarahsair25

Sarah Sair

AI / GenAI Engineer & Data Scientist

Hi, I’m Sarah 👋

AI / GenAI Engineer & Data Scientist

🤖 GenAI & LLM Engineering

Core Focus:

📄 Featured Work: Prompt Engineering Case Study

📊 Data Analytics & SQL Projects

🧠 SQL Mentor — User Performance Analysis (PostgreSQL)

🛒 SQL-Only E-Commerce Analytics Platform

📁 Sales Data Cleaning (SQL Server / SSMS)

🧠 Machine Learning Projects

📊 Customer Churn Prediction

💳 Credit Card Fraud Detection

🛠 Application Development

🤖 NLP Chatbot (Flask App)

🔧 Technical Stack

Languages

Databases

AI / GenAI

Machine Learning

Data Analytics

Tools

📈 Currently Building

🤝 Let's Connect

📊 GitHub Stats

Pinned Loading

Uh oh!