I'm a lead research engineer at Dynamo AI (YC 22) based in Amsterdam (NL) / Gdynia (PL). I work on (synthetic) data flywheels, evaluations, and training, all aimed at building efficient and aligned custom guardrailing and judge models. The complexity—and fun—lies in tackling subjective, under-specified objectives through iterative human-model alignment.
Before joining Dynamo I worked in RL for Combinatorial Optimization and Code Generation teams at Qualcomm AI Research in Amsterdam. I studied Artifical Intelligence at the Univeristy of Amsterdam, specializing in Reinforcement Learning where I did a 9 month intership at Amsterdam Machine Learning lab with prof. Herke van Hoof.
- 🌟 Built Dynamo's output guardrail offering and team from scratch into a mature, high-demand product. I touched every part of the stack—from defining evaluation sets with PMs, setting up annotation and feedback loops, synthetic data generation, to training and implementing post-training interventions for efficient inference. The product now safeguards AI deployments at several Fortune 500 companies (1, 2, 3).
- 🚀 With my team at Qualcomm, we achieved SOTA on The Abstraction and Reasoning Challenge (ARC) using a ~220M parameter language model by combining hindsight relabeling and prioritized hindsight replay (ICML '24 paper). Also proud of exploring MCTS as a neurally-guided decoding strategy for zero-human-data regimes, despite it turning into a valuable learning experience (ICML '24 workshop paper).
- ⚡ Demonstrated that hierarchical RL can alleviate congestion in power grids up to 6x more effectively than physics-based simulators, confirming the advantage of hierarchical policies. We shared these insights in a paper.
Outside work, I’m passionate about endurance sports 🏊🚴♂️🏃♂️ and the science behind peak human performance. My favorite is Middle Distance Triathlon (70.3 Ironman), and I’ve got a sub-10 Ironman race under my belt, still chasing that sub-9 dream. While I don't get much time for other sports, I remain enthusiastic and determined—surfing may still happen one day! 🌊
Contact: Want to chat about AI, go for a bike ride, or grab coffee? Send me a DM on X / LinkedIn / Strava.