https://ml-challenge.mercadolibre.com/
The task is to predict how long it will take for the inventory of a certain item to be sold completely.
For evaluation, given the item target stock, a prediction will be provided for the number of days it will take an item to run out of stock. Possible values range from 1 to 30. Rather than giving a point estimate, it is expected to provide a probability value for each of the possible outcomes.
Check out the preprocessing Notebook or this kernel : https://www.kaggle.com/leangab/processing-meli-aug-05
Data was pre-processed with Pyspark and modeled via XGBoost and a softmax objective function. Hyperparameters were optimized with Ray Tune.
It was computed the running total for a given SKU during the first and the last 30 days. The first portion of the data was used as training set and the second as validation set.
- Download the input files from https://ml-challenge.mercadolibre.com/rules, and place them into the DATA folder
- Install the dependencies with pip via the requirements.txt file located in the main directory
pip install -r requirements.txt
- Then run main.py:
python main.py
A csv with the outcome of the analysis will be generated in the DATA folder.