Skip to content
View nancyboukamel-ds's full-sized avatar

Block or report nancyboukamel-ds

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nancyboukamel-ds/README.md

Hi, I'm Nancy Bou Kamel 👋

🎙️ Speech & Audio AI Engineer | ASR · TTS · Voice AI · LLM · Production ML

Passionate about building AI systems that understand, generate, and reason about sound. My engineering practice lives at the intersection of speech recognition, acoustic modelling, and real-time voice AI from embedded hardware to cloud-native conversational agents.

I completed my M.Sc. Data Science with a Thesis GPA of 1.0, dedicating my research to lightweight, noise-robust ASR on battery-powered embedded devices compressing a 1 GB model to <500K parameters using Knowledge Distillation, achieving 75% latency reduction and Macro F1 >0.8 across all real-world noise conditions.


💻 Technical Toolkit

Category Skills & Frameworks
ASR & Acoustic Modelling RNN-Transducer · CTC · Whisper (LoRA) · WFST Decoding · MFCC · Mel-Filterbanks
TTS & Voice Synthesis Cartesia TTS · Real-Time Voice Serving · Multi-Speaker Pipelines
Real-Time Voice Stack LiveKit Agents · Deepgram STT · Silero VAD · Multilingual Turn Detection · BVC Noise Cancellation
Model Compression Knowledge Distillation · LoRA Fine-Tuning · ONNX · Quantisation · Latency Optimisation
LLM & Agentic AI LangChain · LangGraph · ReAct · RAG Pipelines · ChromaDB · Claude · OpenAI · Gemini
Deep Learning PyTorch · PyTorch Lightning · HuggingFace Transformers · Scikit-learn · Optuna
MLOps & Cloud Python (Advanced) · Docker · CI/CD · Git · AWS (Lambda · S3 · EC2 · Bedrock · CloudWatch)

🔬 Highlighted Projects

🎙️ Noise-Robust ASR — Master Thesis @ Cardo Systems Lightweight RNN-T/CTC ASR for embedded devices · Knowledge Distillation · 98% compression · 75% latency reduction · Macro F1 >0.8 across motorcycle / skiing / cycling noise profiles

🤖 LiveKit Multi-Agent Voice AI Real-time conversational agent · Deepgram STT + Cartesia TTS + Silero VAD + GPT-4o-mini · 3-agent architecture with tool orchestration and handoff logic · Deployed in production

🧠 RAG Agentic System LangGraph ReAct agent · ChromaDB vector search · Anthropic Claude · Async FastAPI · Docker

📝 GPT from Scratch Transformer LLM built from first principles · Attention · Tokenisation · Training pipeline · Perplexity evaluation


🏆 1st Place — Facebook Women Hackathon 2021 🌍 Berlin, Germany · English C1 · German B2 · Arabic Native 📩 nancyboukamel12@gmail.com · LinkedIn · GitHub

Pinned Loading

  1. Vector_db_RAG Vector_db_RAG Public

    A scalable RAG system built with Python, Flask, and FAISS that uses a retrieval model and a generative model to provide fact-based answers, optimized with GPU acceleration for efficient vector sear…

    Jupyter Notebook 1

  2. Berlin-Traffic-Jam-Detection Berlin-Traffic-Jam-Detection Public

    This project detects traffic jams in Berlin using real-time sensor data. It employs anomaly detection techniques like autoencoders and IQR to identify traffic congestion across segments and time in…

    Jupyter Notebook 1

  3. mall_customer_segmentation mall_customer_segmentation Public

    This project utilizes clustering algorithms such as K-Means, DBSCAN, and Affinity Propagation to identify customer segments in a mall based on factors like annual income, age, and spending score.

    Jupyter Notebook

  4. stock_price_prediction stock_price_prediction Public

    Stock Price Prediction And Forecasting Using Stacked LSTM

    Jupyter Notebook

  5. ChocoDelight ChocoDelight Public

    The ChocoDelight Factory project uses machine learning to predict production issues, identify key problem factors, and estimate rework savings. An interactive dashboard displays predicted vs. actua…

    Jupyter Notebook

  6. TennisEye TennisEye Public

    An advanced analytical platform that utilizes computer vision to study ball dynamics and player interactions, providing deep insights for enhanced game performance.

    Jupyter Notebook 1