Stealth Startup / Engineer
1 JUN 2023 - PRESENT, LONDON
doc.ai / Data & AI Engineer
16 JUN 2021 - 20 OCT 2022, AI MODULES, REMOTE
- Digital Biomarker AI for Myasthenia Gravis:
- Build audio feature extraction pipeline using signal processing
- Standardise ML experiment tracking and evaluation using MLFlow and Sklearn
- Train random forest, SVM and CNN models
- Build data annotation pipelines with AWS
- Co-author paper for Digital Biomarkers (1)
- PDF Data Extraction Pipeline:
- Extract entities from PDFs into CSVs using image processing, OCR, NLP
- Transform between CSV and PDF annotations for human-in-the-loop learning
- Medication Recognition REST API:
- Design information retrieval algorithm to match OCR, NLP results to a graphically structured database with paper accepted at 2022 ICML Workshop on Interpretable Machine Learning in Healthcare (2)
- Build user feedback mechanism for REST API with Golang, SQL and yoyo
Health Data Insight / ML Intern
6 JUL - 28 AUG 2020, SYNTHETIC DATA SERVICE, REMOTE
- Evaluate quality of synthetic, irregular time series
- Apply differential privacy to GANs
- Design project logo
London Mathematical Society / Writer
1 JUL - 30 AUG 2019, MATHEMATICAL SUCCESS STORIES PROJECT, REMOTE
- Write articles on historical mathematicians for a public audience
- Contact academics and professionals and collect their career stories
Rolls Royce / Work Experience Engineer
25-31 JUL 2016, ADVANCED BLADE CASTING FACILITY, ROTHERHAM
- Analyse defect data using Minitab
Network Rail / Engineering Education Scheme Student
NOV 2015 - APR 2016, YORK
- Design and prototype monitoring system for user-worked crossings with Arduino
- Attend electronical engineering lab at Newcastle University
- Present work at event held at Sheffield University
Data analysis: Python, Pandas, Scikit-Learn, Matplotlib, Seaborn, Numpy, SQL, Excel
Data annotation: AWS Sagemaker, VGG Image Annotator, Label Studio
Machine learning: Python, Tensorflow, Keras, Golang, Docker, Kubernetes
Version control: GitHub, MLFlow
Cloud computing: GCP (Compute, Storage, Healthcare), AWS (S3, Medical, Sagemaker, Lambda)
Prototyping: Gradio, FastAPI, Balsamiq, Langchain, LlamaIndex, Javascript, Bootstrap
University of Cambridge / BA (Hons) Mathematics
OCT 2017 - SEPT 2021 , MURRAY EDWARDS COLLEGE, CAMBRIDGE
Manhattan Urban Forest / Python + Pandas
DEC 2022, GITHUB PAGES
- Awarded 5th among over 1k participants in data science competition
- Exploring and drawing recommendations from New York tree census data
Flocking / Javascript + P5
JUL 2021, GITHUB PAGES
- Interactive predator-prey boids simulation which runs in the browser
- Based on this video by The Coding Train with my addition of:
- Predator and food objects and interactions
- Boid-mouse attraction
- Click to add new food, prey, predator
Curve on a Sphere / Blender
JAN 2021, YOUTUBE
- Animated curve being painted on a glass sphere using procedual textures
- Recorded sound effects
- Inspired by an example sheet question from Part 1A Vector Calculus