Bridging the gap between cutting-edge AI research and user-centric web applications. Specializing in Computer Vision, Vision Language Models (VLMs), and Intelligent Document Processing (IDP). I enjoy building scalable systems that bring AI models to real-world production.
- AI & Deep Learning: Computer Vision, Natural Language Processing, Document Layout Analysis (DLA)
- Models & Architectures: ViT, Swin Transformer, MobileNetV3, YOLOX, Qwen VLMs
- Engineering: Seamlessly integrating heavy ML models into responsive web platforms (Toss, Polaris Office)
- DocLayout & Scanner: Developing advanced document layout analysis and scanning solutions.
- Polaris Office & Image Tools: Contributing to AI-driven features for robust office utility platforms.
- Quizly: A dynamic platform for interactive personality tests and quizzes.
- Pocketface: A web application utilizing image classification to find look-alikes.
- Apps in Toss: Experience in building and deploying user-facing applications within the Toss ecosystem.
LibreOffice Core Contributor > Actively contributing to the LibreOffice core engine. Experienced in navigating and optimizing large-scale
C++codebases, participating in global code reviews, and collaborating with a worldwide community of developers.

