Kaggle_M5_Forecasting_Accuracy_top4%_Pro

This is a competition M5 forecasting - accuracy on kaggle. For details, please refer to the link: https://www.kaggle.com/c/m5-forecasting-accuracy/overview. This is my first time to take part in the kaggle competition. After two months of hard work, I finally ranked 172, top4% and won a silver medal.

Catalogue introduction

dataset: The directory where the dataset is stored.
features: The pkl file for generating features is stored in this directory.
models: The model files generated in the training process are placed in it.
sub: The generated CSV file is placed in this directory.
utils.py: Contains some of the functions used.
fe.py: Execution via Python fe.py generate feature files to the features directory.
train_state.py: By training the dataset according to the state partition, the state.csv will be created in the sub directory.
train_store.py: By training the dataset according to the store partition, the store.csv will be created in the sub directory.
fusion.py: The state.csv and store.csv are weighted and fused according to different weights.
pictures: Saving the pictures.

How to run

1.Python fe.py
2.Python train_state.py
3.Python train_store.py
4.Python fusion.py
You can also write the above steps as a script to run.

Overall framework

First of all, we passed the fe.py Production features, including holiday features, price features, lag features, etc.
There are three states in this data set, including CA, TX and WI. However, we only read the data of two states according to CA and WI, and use LGB for training, because we found that the effect of reading CA and WI state training is better than that of TX.
We read data from 10 stores, train them separately, and get the model of 10 stores. Finally, we forecast the sales volume of each store in 28 days.
We read the predictions of CA and WI states according to the stores of each state, and get the stores of CA_x and WI_x, respectively, and weighted fusion with the predicted stores of CA_x and WI_x to get the final result. The overall flow is shown below：

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kaggle_M5_Forecasting_Accuracy_top4%_Pro

Catalogue introduction

How to run

Overall framework

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
dataset		dataset
features		features
models		models
pictures		pictures
sub		sub
README.md		README.md
fe.py		fe.py
fusion.py		fusion.py
train_state.py		train_state.py
train_store.py		train_store.py
utils.py		utils.py

Greak-1124/Kaggle_M5_Forecasting_Accuracy_Pro

Folders and files

Latest commit

History

Repository files navigation

Kaggle_M5_Forecasting_Accuracy_top4%_Pro

Catalogue introduction

How to run

Overall framework

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages