pacman100

Follow

Sourab Mangrulkar pacman100

Follow

Applied Scientist & ML Engineer. ML and Deep Learning Enthusiast.

472 followers · 16 following

Achievements

Achievements

pacman100/README.md

hey there

About Me :

I'm Sourab Mangrulkar; an Applied Scientist and Machine Learning Engineer from India 🇮🇳.

🔭 I’m currently working as an Applied Scientist at Amazon.
🌱 Exploring Natural Language Processing, Computer Vision and Distributed Training at Scale. Always up for meaningful collaboration.
😄 Pronouns: He/His/Him.
⚡ Painting 🎨, sketching ✍️ and poetry 📝 are my favourite hobbies. Recently, I've started reading up on stocks and economic markets.
📫 How to reach me:

📝 Research :

Arxiv 2023 BigCode Project: SantaCoder: don't reach for the stars!
SIGIR 2022 (eCommerce Workshop): HISS: A Novel Hybrid Inference Architecture in Embedding Based Product Sourcing using Knowledge Distillation
KDD 2022 (ADS Track): BE3R: BERT-based early-exit using expert routing
WWW 2022 (Industry Track): Multilingual Semantic Sourcing using Product Images for Cross-lingual Alignment
SIGDial 2018: A Context-aware Convolutional Natural Language Generation model for Dialogue Systems

✍️ Blog Posts :

March 2024: You can now train a 70b language model at home - In collaboration with Answer.AI
February 2024: 🤗 PEFT welcomes new merging methods
January 2024: Finetune LLMs on your own consumer hardware using tools from PyTorch and Hugging Face ecosystem
December 2023: Mixture of Experts Explained
October 2023: Personal Copilot: Train Your Own Coding Assistant
October 2023: Falcon 180B Finetuning using 🤗 PEFT and DeepSpeed
September 2023: Fine-tuning Llama 2 70B using PyTorch FSDP
August 2023: Large Scale Training of Hugging Face Transformers on TPUs With PyTorch/XLA FSDP
June 2023: The Falcon has landed in the Hugging Face ecosystem
May 2023: Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA
February 2023: 🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware
June 2022: Accelerate Large Model Training using DeepSpeed
May 2022: Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

💬 Talks and Presentations

December 2023: Hands on session for the ACM Winter School focused on Generative AI @ ACM Winter School India Chapter 2023
December 2023: Generative AI for All. 🤗 PEFT: Finetuning made simple, efficient and extendable @ NeurIPS Conference 2023 - Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
October 2023: Training a LLaMA in your Backyard: Fine-tuning Very Large Models on Consumer Hardware @ PyTorch Conference 2023
August 2023: Unleashing LLMs: Training, Finetuning, and Evaluating @ DataHack Summit 2023 (Analytics Vidhya)
August 2023: Parameter-Efficient Fine-Tuning: Doing more with less @ DataHack Summit 2023 (Analytics Vidhya)

Pinned Loading

LLM-Workshop LLM-Workshop Public

LLM Workshop by Sourab Mangrulkar

Jupyter Notebook 363 131
openhathi_instruct openhathi_instruct Public

This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and…

Jupyter Notebook 23 5
Extractive-Text-Summarization-Using-Neural-Networks Extractive-Text-Summarization-Using-Neural-Networks Public

Extractive Text Summarization Using Neural Networks approaches: CNN, RNN and Linear-SVM

2
Registration_App Registration_App Public

This is an android app for events registration which was used for the NIT-GOA's cultural fest Raag.

Java
Machine_Learning Machine_Learning Public

Machine Learning models for Regression Tasks and Classification Tasks.

Jupyter Notebook 3