My name is
- β‘ Previously I worked as Machine Learning Engineer at Lincoln for a couple of weeks before moving to the US for my Master in Business & Data Science at Texas Tech University, Rawls College of Business. Before that, I was Data Scientist for 2 years at Axionable, first Sustainable AI startup in France and Canada. Also I spent 2 years and 6 months at IBM as Machine Learning Consultant.
- β€οΈ I love Data Science, Natural Language Processing, Cloud Computing & MLOps
- π©Ί What keeps me in shape
- When I was in France, I had Taekwondo classes π₯ on Tuesday, Thursday, Friday & Saturday at Mudo Club Argenteuil
- Daily morning runner ππΎ
- Occasional football player β½οΈ with friends
- AttiΓ©kΓ©, Yassa, MafΓ©, Thieb, etc. π
- π± Iβm addicted to continuous learning, which makes me grow on a regular basis
- π I'm sharing my knowledge through my blog in order to make good impact on others life
- π« How to find me
- π ZoumDataScience
- πΉ YouTube Channel
- π‘ Medium Articles
- π’ LinkedIn
This is the collection of all the resources I have created, organized by topics.
Subscribe to:
- My YouTube channel for videos related to Python and data sience
- My Medium newsletter for updates of my blogs in your mailbox
- ZoumDataScience for Pandas and Python tips and trics in your mailbox
- Data Science
- Machine Learning
- MLOps
- Natural Language Processing
- Large Language Models
- Retrieval Augmented Generation
- Python
- Pandas & Python Tricks
- Computer Vision
Title | Article Link | Video |
---|---|---|
A simple way to understand Association Rule from the Customer Basket Analysis Use Case | π | |
Different Metrics to Evaluate Binary Classification Models and Some Strategies to Choose the Right One | π | |
Introduction to Mito: Spreadsheet for Data Scientists That Also Generates Python Codes | π | |
When R Meets SQL to Query Dataframes | π | |
5 Essential Tools to Start a Career in Data Science and Data Analytics | π | |
4 Types of SQL JOIN Every Data Scientist Should Know: Visual Representation | π | |
Data Preprocessing Using Pipeline in Pandas | π | π |
The guide to choosing the right database for my project: MongoDB vs. MySQL | π | |
How to Run SQL Queries On Your Pandas DataFrames With Python | π | π |
Algorithmic Bias in Healthcare and Some Strategies for Mitigating It | π | |
Which One of These 2 Open-Source Libraries Is Better for Processing Gigabytes of Data? | π | π |
ChatGPT for Data Scientists, Data Analysts, and Programmers | π | π |
Tableau Data Blending TutorialβββA Step-By-Step Guide For Beginners | π | |
Fundamentals of Statistics All Data Scientists & Analysts Should KnowβββWith CodeβββPart 1 | π | π |
Everything You Need to Know About Heatmap β Tutorial With PowerBI | π | |
Top Techniques to Handle Missing Values Every Data Scientist Should Know | π | |
An Introduction to Hierarchical Clustering in Python | π | |
Multiple Linear Regression in R: Tutorial With Examples | π | |
NoSQL Databases: What Every Data Scientist Needs to Know | π |
Title | Article Link | Video |
---|---|---|
Transfer Learning: Understand the Big Picture & Make the Right Choices for Your Use Case | π | |
Overview Of 4 Model Validation Approaches to Mitigate Overfitting Problem | π | |
eXplainable AI (XAI): LIME & SHAP, Two Great Candidates to Help You Explain Your Machine Learning Models | π | |
Using Gradio To Create Apps For Your Machine Learning Models | π | π |
How to Perform KMeans Clustering Using Python | π | π |
Classification in Machine Learning: An Introduction | π |
Title | Article Link | Video |
---|---|---|
Create An Awesome Streamlit App & Deploy it With Docker | π | |
Machine Learning models monitoring made easy with Mlfow, a concrete use case with Python API | π | |
When Your Machine Learning model teams up with Django REST API, A successful deployment into production | π | |
NLP MLops Project With DagsHub β Multi-Language Sentiment Classification Using Transformers β Part 1 | π | |
NLP MLops Project With DagsHub β Deploy Your Streamlit App On AWS EC2 Instance β Part 2 | π | |
Step-by-step Approach to Build Your Machine Learning API Using Fast API | π | |
Data And Model Versioning With DVC And Azure Blob Storage | π | |
GitHub Actions for Machine Learning: Train, Test and Deploy Your ML Model on AWS EC2. | π | |
CI/CD for Machine Learning Model Training with GitHub Actions | π | |
Speed Up Your Model Training with DagsHub Direct Data Access on AWS | π | |
Git Reset and Revert Tutorial for Beginners | π |
Title | Article Link | Video |
---|---|---|
Do You Want To Cluster Unlabeled Text Data? Try Out Topic Modeling | π | |
Financial Text Classification With Deep Learning Using FinBERT | π | |
Named Entity Recognition with Spacy and the Mighty roBERTa | π | π |
Scientific Documents Similarity Search With Deep Learning Using Transformers (SciBERT) | π | |
Meet BERTopicβ BERTβs Cousin For Advanced Topic Modeling | π | π |
Unsupervised Multilingual Text Classification With Zero-Shot Approach | π | |
Semantic Keywords And Keyphrases Extraction With KeyBERT | π | |
4 NLP Libraries for Automatic Language Identification of Text Data In Python | π | |
Data Augmentation in NLP Using Back Translation With MarianMT | π | π |
Social Media Sentiment Analysis In Python With VADER β No Training Required! | π | π |
Stemming, Lemmatizationβ Which One is Worth Going For? | π | |
VADER Vs. TextBlob β Which One Is Better For Social Media Sentiment Analysis? | π | |
Most Common Text Processing Tasks In Natural Language Processing | π | π |
How to Perform Speech-to-Text and Translate Any Speech to English With OpenAIβs Whisper | π | π |
Plagiarism Detection Using Transformers | π | π |
Text-to-Image and Image-to-image search Using CLIP | π | |
A Step-by-step Guide to Solving 4 Real-life Problems With Transformers and Hugging Face | π | π |
Text data representation with one-hot encoding, Tf-Idf, Count Vectors, Co-occurrence Vectors and Word2Vec | π | |
Fine-Tuning GPT-3 Using the OpenAI API and Python | π |
Title | Article Link | Video |
---|---|---|
Multimodal Retrieval Augmented Generation Applied To Real World Case β With Code | π | π |
A Framework For Efficiently Serving Your Large Language Models | π | π |
How To Scrape a Web Page With ChatGPT β No Coding Required! | π | π |
How to Chat With Any PDFs and Image Files Using Large Language Models β With Code | π | π |
Multimodal Retrieval Augmented Generation Applied To Real World Case β With Code | π | π |
Document Parsing Using Large Language Models β With Code | π | π |
How to Build Anything With AI Agents - With Code | π |
Title | Article Link | Video |
---|---|---|
How I Built A Video Recommendation System Using Large Language Models and Vector Database | π | |
How to Build RAG based Chatbot: 5 Steps with Amazon Bedrock | π |
Title | Article Link | Video |
---|---|---|
5 Python open-source tools to extract text and tabular data from PDF Files | π | |
When Should You Consider Using Datatable Instead of Pandas to Process Large Data? | π | |
Convert Any Type of Document to Text With Apache Tika Using Python API | π | |
Collect Data From Reddit and Twitterβ 600+ Million Monthly Active Users Platforms | π | |
Knockknock β Probably The Best Python Library For Notifications | π | |
Extract Text Written in Different Languages from Images with Python | π | |
Introduction to Twint: Say Goodbye to Twitter Rate Limitations β Also No Need for A Twitter API! | π | |
Avoid Using βpip freezeβ β Use βpipreqsβ instead | π | |
Extract Tweets Without Limitations in a Few Lines of Code Using Python | π | π |
Collect Data from Twitter: A Step-by-Step Implementation Using Tweepy | π | |
How to Create a Virtual Environment and Use it on Jupyter Notebook | π | π |
Title | Article Link | Video |
---|---|---|
Pandas and Python Tips and Tricks for Data Science and Data Analysis | π | π |
Pandas & Python Tricks for Data Science & Data Analysis β Part 2 | π | π |
Title | Article Link | Video |
---|---|---|
Five Simple Image Data Augmentation Techniques to Mitigate Overfitting In Computer Vision | π | |
YOLO Object Detection Explained | π | |
How to Measure Model Performance in Computer Vision: A Comprehensive Guide | π |