Skip to content
View sparkup's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report sparkup

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sparkup/README.md

Hey, I’m David.

I’m a senior software and data engineer with over 15 years of experience across full-stack development, data engineering, and technical leadership. My work has consistently focused on designing, industrialising, and operating production-grade platforms and systems in data-intensive environments.

I specialise in building reliable, scalable data platforms and backend systems that support complex business workflows, APIs, and decision-critical applications. Alongside this foundation, I design and integrate applied AI solutions — including LLM-based systems, RAG architectures, and MLOps pipelines — with a strong emphasis on robustness, observability, and long-term production viability.

Through both professional and personal projects, I work end-to-end across the system lifecycle: architecture and data modelling, pipelines, distributed systems, CI/CD, deployment, monitoring, and operational reliability. My focus is always on turning data and AI into durable, maintainable systems that deliver real business value.

Outside of engineering, I’m into trail running and ultra-endurance. If you’re ever around Lyon and feel like chatting about systems, data, or AI — or going for a run — feel free to reach out.


Focus areas

Data & AI Platform Engineering · Production AI Systems · Data Engineering · LLM-based Systems · RAG Architectures · MLOps · DevOps

Popular repositories Loading

  1. medical-llm-finetuning-alignment medical-llm-finetuning-alignment Public

    Medical LLM fine-tuning and preference alignment using SFT and DPO, with evaluation and deployment examples.

    Jupyter Notebook 1

  2. rag-assisted-chess-agent rag-assisted-chess-agent Public

    An agent-assisted chess analysis system combining Stockfish, RAG, and a modular web architecture, designed as an extensible foundation for future LLM-driven autonomy.

    JavaScript 1

  3. multimodal-data-pipeline-etl multimodal-data-pipeline-etl Public

    Multimodal ETL pipeline for extracting, transforming, and storing web content (text, images, audio, video). Orchestrated with Apache Airflow and designed as a foundation for analytics, machine lear…

    Jupyter Notebook 1

  4. autonomous-spacecraft-rl autonomous-spacecraft-rl Public

    Autonomous spacecraft reinforcement learning project focused on mission control tasks (navigation, control, and decision‑making) using modern RL algorithms. Includes training notebooks, reproducibl…

    Jupyter Notebook 1

  5. semi-supervised-medical-image semi-supervised-medical-image Public

    Semi-supervised medical image classification pipeline comparing unsupervised clustering and weak-label methods (label propagation, pseudo-labeling) on CNN embeddings. Includes notebooks, reusable f…

    Jupyter Notebook 1

  6. cultural-events-rag-assistant cultural-events-rag-assistant Public

    Cultural events RAG assistant using OpenAgenda data, FAISS, and Mistral. Ingests public events, builds a vector index, and serves a FastAPI /ask endpoint for natural‑language queries. Dockerized fo…

    Python 1