I'm a Data Scientist specializing in Document Intelligence and Large Language Models. My work focuses on building robust document processing pipelines and developing efficient information retrieval systems. I'm particularly passionate about:
- π Document AI and multimodal transformers
- π Information retrieval and indexing systems
- π€ LLM reasoning and Chain of Thought approaches
- π± Exploring new architectures in RAG design patterns
- π§ Experimenting with reasoning models (DeepSeek-R1)
- π Improving document indexing and retrieval systems
- π Chain of Thought implementations
- πΌοΈ Vision transformer architectures for document understanding
- π Python
- π SQL
- π€ RAG Pipelines
- π§ LLMs (Open source and Closed source)
- π‘ Prompt Engineering
- π― Fine-tuning (PEFT, LoRA)
- π RLHF
- π₯ PyTorch
- β‘ TensorFlow
- π Scikit-learn
- π LangChain
- π LlamaIndex
- π Qdrant
- π― Vespa
- π FAISS
- πΎ PostgreSQL
- π MongoDB
- βοΈ AWS
- π· Azure AI Studio
- β‘ Azure Functions
- π³ Docker
- π Git
- π Document Layout Analysis
- π OCR
- π― Object Detection
- ποΈ Vision Transformers
- π§ CNN
- π Statistics
- π¬ Scientific Research
- π€ Supervised Learning
- π― Unsupervised Learning
- π₯ Active volunteer in community development initiatives
- π± Interested in technology for social impact