This repository contains materials and source code of a tutorial named “Quick Start Guide of Kaggle: Machine Learning Competitions with Python” (Pythonで機械学習コンペティション「Kaggle」をはじめよう) in Scipy Japan 2020, held on October 30.
In the recent high-profile machine learning competition platform known as Kaggle, data scientists from all over the world are using Python to build machine learning models. In this hands-on tutorial, you'll learn the basics of machine learning and Kaggle by running the Notebook-style source code. The objective is to help participants learn how to compete and learn with Kaggle using Python. The Speaker won first place in a Kaggle competition, hosted a Kaggle Days Tokyo competition, and published a technical book for beginners.
What is machine learning & Kaggle?
- Participation in a competition
- How to use Python environment in Kaggle
- Loading packages
- Loading datasets
- Feature engineering
- Training and prediction of machine learning algorithms
- Submission to the leaderboard
- Exploratory data analysis
- Adding hypothesis-based features
- Switching machine learning algorithms
- Hyper parameters tuning
- The importance of validation
- Ensembling
Wrap up & future resources
- Start: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-01
- Benchmark: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-02
- Exploratory data analysis: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-03
- Adding hypothesis-based features: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-04
- Switching machine learning algorithms: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-05
- Hyper parameters tuning: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-06
- The importance of validation: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-07
- Ensembling: https://www.kaggle.com/sishihara/python-kaggle-start-book-ch02-08
All are from: https://github.com/upura/python-kaggle-start-book
You can see the slides here. If you want to click URLs in the slides, please download PDF.
Log in to Kaggle website. If you don’t have an account, please create the account. Be careful because unlike your display name, you can’t change your Kaggle ID after registration. Your Kaggle ID is different from your display name. For example, my Kaggle ID is ‘sishihara’ which can’t be changed and the display name is ‘u++’ which can be changed in your profile page.