Skip to content

leandroroser/meli_data_challenge_2021

Repository files navigation

Meli Data Challenge 2021

https://ml-challenge.mercadolibre.com/

Goal of the project

The task is to predict how long it will take for the inventory of a certain item to be sold completely.

For evaluation, given the item target stock, a prediction will be provided for the number of days it will take an item to run out of stock. Possible values range from 1 to 30. Rather than giving a point estimate, it is expected to provide a probability value for each of the possible outcomes.

Exploratory data analysis

Check out the preprocessing Notebook or this kernel : https://www.kaggle.com/leangab/processing-meli-aug-05

Strategy

Data was pre-processed with Pyspark and modeled via XGBoost and a softmax objective function. Hyperparameters were optimized with Ray Tune.

It was computed the running total for a given SKU during the first and the last 30 days. The first portion of the data was used as training set and the second as validation set.

Execution

  • Install the dependencies with pip via the requirements.txt file located in the main directory

pip install -r requirements.txt

  • Then run main.py:

python main.py

Results

A csv with the outcome of the analysis will be generated in the DATA folder.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published