Skip to content

My Exercises from Kaggle's Data Science and Machine Learning Courses.

Notifications You must be signed in to change notification settings

federicobrancasi/Kaggle-Courses

Repository files navigation

Kaggle Courses

Welcome to this repository! Here you will find a comprehensive collection of exercises done during various courses on Kaggle, designed to help you develop the skills needed for independent data science projects.

🚀 Learn and Test Yourself:

Each exercise in this repository reflects my commitment to mastering the fundamental skills needed to tackle real-world challenges in data science. Courses on Kaggle have provided me with a solid foundation, and these exercises represent my learning journey from basic concepts to advanced techniques.

📂 Repository Courses:

Each folder contains detailed exercises, solutions and annotations created during the learning journey. I hope this collection will be inspiring and helpful to anyone who wants to dive into the fascinating world of data science and machine learning.

🌐 Explore, Share, Contribute:

Feel free to explore, clone the repository and use the exercises for your own learning. If you have suggestions, corrections or new exercises to share, they will be more than welcome! This space is also designed for collaboration and knowledge exchange.

Happy coding! 🚀📊✨

Intro to Machine Learning

  1. How Models Work
    The first step if you're new to machine learning.

  2. Basic Data Exploration
    Load and understand your data.

  3. Your First Machine Learning Model
    Building your first model. Hurray!

  4. Model Validation
    Measure the performance of your model, so you can test and compare alternatives.

  5. Underfitting and Overfitting
    Fine-tune your model for better performance.

  6. Random Forests
    Using a more sophisticated machine learning algorithm.

  7. Machine Learning Competitions
    Enter the world of machine learning competitions to keep improving and see your progress.

Certificate

Illustration
Intro to Machine Learning - Certificate

Pandas

  1. Creating, Reading and Writing
    You can't work with data if you can't read it. Get started here.

  2. Indexing, Selecting & Assigning
    Pro data scientists do this dozens of times a day. You can, too!

  3. Renaming and Combining
    Data comes in from many sources. Help it all make sense together.

  4. Summary Functions and Maps
    Extract insights from your data.

  5. Grouping and Sorting
    Scale up your level of insight. The more complex the dataset, the more this matters.

  6. Data Types and Missing Values
    Deal with the most common progress-blocking problems.

Certificate

Illustration
Pandas - Certificate

Intermediate Machine Learning

  1. Introduction
    Review what you need for this Micro-Course.

  2. Missing Values
    Missing values happen. Be prepared for this common challenge in real datasets.

  3. Categorical Variables
    There's a lot of non-numeric data out there. Here's how to use it for machine learning.

  4. Pipelines
    A critical skill for deploying (and even testing) complex models with pre-processing.

  5. Cross-Validation
    A better way to test your models.

  6. XGBoost
    The most accurate modeling technique for structured data.

  7. Data Leakage
    Find and fix this problem that ruins your model in subtle ways.

Certificate

Illustration
Intermediate Machine Learning - Certificate

Data Visualization

  1. Hello, Seaborn
    Your first introduction to coding for data visualization.

  2. Line Charts
    Visualize trends over time.

  3. Bar Charts and Heatmaps
    Use color or length to compare categories in a dataset.

  4. Scatter Plots
    Leverage the coordinate plane to explore relationships between variables.

  5. Distributions
    Create histograms and density plots.

  6. Choosing Plot Types and Custom Styles
    Customize your charts and make them look snazzy.

  7. Final Project
    Practice for real-world application.

Certificate

Illustration
Data Visualization - Certificate

Feature Engineering

  1. What Is Feature Engineering
    Learn the steps and principles of creating better features

  2. Mutual Information
    Locate features with the most potential.

  3. Creating Features
    Transform features with Pandas to suit your model.

  4. Clustering With K-Means
    Untangle complex spatial relationships with cluster labels.

  5. Principal Component Analysis
    Discover new features by analyzing variation.

  6. Target Encoding
    Boost any categorical feature with this powerful technique.

Certificate

Illustration
Feature Engineering - Certificate

Intro to SQL

  1. Getting Started With SQL and BigQuery
    Learn the workflow for handling big datasets with BigQuery and SQL.

  2. Select, From & Where
    The foundational compontents for all SQL queries.

  3. Group By, Having & Count
    Get more interesting insights directly from your SQL queries.

  4. Order By
    Order your results to focus on the most important data for your use case.

  5. As & With
    Organize your query for better readability. This becomes especially important for complex queries.

  6. Joining Data
    Combine data sources. Critical for almost all real-world data problems.

Certificate

Illustration
Intro to SQL - Certificate

Advanced SQL

  1. JOINs and UNIONs
    Combine information from multiple tables.

  2. Analytic Functions
    Perform complex calculations on groups of rows.

  3. Nested and Repeated Data
    Learn to query complex datatypes in BigQuery.

  4. Writing Efficient Queries
    Write queries to run faster and use less data.

Certificate

Advanced SQL - Certificate

Intro to Deep Learning

  1. A Single Neuron
    Learn about linear units, the building blocks of deep learning.

  2. Deep Neural Networks
    Add hidden layers to your network to uncover complex relationships.

  3. Stochastic Gradient Descent
    Use Keras and Tensorflow to train your first neural network.

  4. Overfitting and Underfitting
    Improve performance with extra capacity or early stopping.

  5. Dropout and Batch Normalization
    Add these special layers to prevent overfitting and stabilize training.

  6. Binary Classification
    Apply deep learning to another common task.

Certificate

Intro to Deep Learning - Certificate

Computer Vision

  1. The Convolutional Classifier
    Create your first computer vision model with Keras.

  2. Convolution and ReLU
    Discover how convnets create features with convolutional layers.

  3. Maximum Pooling
    Learn more about feature extraction with maximum pooling.

  4. The Sliding Window
    Explore two important parameters: stride and padding.

  5. Custom Convnets
    Design your own convnet.

  6. Data Augmentation
    Boost performance by creating extra training data.

Certificate

Computer Vision - Certificate

Time Series

  1. Linear Regression With Time Series
    Use two features unique to time series: lags and time steps.

  2. Trend
    Model long-term changes with moving averages and the time dummy.

  3. Seasonality
    Create indicators and Fourier features to capture periodic change.

  4. Time Series as Features
    Predict the future from the past with a lag embedding.

  5. Hybrid Models
    Combine the strengths of two forecasters with this powerful technique.

  6. Forecasting With Machine Learning
    Apply ML to any forecasting task with these four strategies.

Certificate

Time Series - Certificate

Data Cleaning

  1. Handling Missing Values
    Drop missing values, or fill them in with an automated workflow.

  2. Scaling and Normalization
    Transform numeric variables to have helpful properties.

  3. Parsing Dates
    Help Python recognize dates as composed of day, month, and year.

  4. Character Encodings
    Avoid UnicodeDecodeErrors when loading CSV files.

  5. Inconsistent Data Entry
    Efficiently fix typos in your data.

Certificate

Data Cleaning - Certificate

Intro to AI Ethics

  1. Introduction to AI Ethics
    Learn what to expect from the course.

  2. Human-Centered Design for AI
    Design systems that serve people’s needs. Navigate issues in several real-world scenarios.

  3. Identifying Bias in AI
    Bias can creep in at any stage in the pipeline. Investigate a simple model that identifies toxic text.

  4. AI Fairness
    Learn about four different types of fairness. Assess a toy model trained to judge credit card applications.

  5. Model Cards
    Increase transparency by communicating key information about machine learning models.

Certificate

Intro to AI Ethics - Certificate

Geospatial Analysis

  1. Your First Map
    Get started with plotting in GeoPandas.

  2. Coordinate Reference Systems
    It's pretty amazing that we can represent the Earth's surface in 2 dimensions!

  3. Interactive Maps
    Learn how to make interactive heatmaps, choropleth maps, and more!

  4. Manipulating Geospatial Data
    Find locations with just the name of a place. And, learn how to join data based on spatial relationships.

  5. Proximity Analysis
    Measure distance, and explore neighboring points on a map.

Certificate

Geospatial Analysis - Certificate

Machine Learning Explainability

  1. Use Cases for Model Insights
    Why and when do you need insights?

  2. Permutation Importance
    What features does your model think are important?

  3. Partial Plots
    How does each feature affect your predictions?

  4. SHAP Values
    Understand individual predictions.

  5. Advanced Uses of SHAP Values
    Aggregate SHAP values for even more detailed model insights.

Certificate

Machine Learning Explainability - Certificate

Intro to Game AI and Reinforcement Learning

  1. Play the Game
    Write your first game-playing agent.

  2. One-Step Lookahead
    Make your agent smarter with a few simple changes.

  3. N-Step Lookahead
    Use the minimax algorithm to dramatically improve your agent.

  4. Deep Reinforcement Learning
    Explore advanced techniques for creating intelligent agents.

Certificate

Intro to Game AI and Reinforcement Learning - Certificate