👽 Galicia Datathon 👽

This project was developed on occasion of the Galicia Dathaton, hosted in Kaggle.

The competition aimed to predict positive conversions among Galicia's clients, in the first trimester of 2019. To that purpose, a dataset with user's behavior in Galicia's website and actual conversions in 2018 were given. The dataset can be downloaded from here.

This notebook - galicia1.ipynb contains trained and optimised -when appropriated- models. Here is an exhaustive list of implemented models: light gradient boosting (kernel), Decision Tree, AdaBoost, RandomForestClaslsifier, CATboost*, XGBoosting*, SupportVectorMachine, Clustering, and finally a stacking solution was submitted to the Kaggle competition. Feature selection was done through PCA and selectKbest(). And models were trained and validated through different kinds of splittings (KFold or simple train/test splitting). It's worth noting that, even if I ended up in the first third of competitors, the participation was more an excuse to implement different models.

respuesta_final.csv provides the submitted solution to the Datathon. It is the result of the stacking prediction. If well the code produces .zip files with solutions for each model alone, these are not provided. Anyway, these solutions could be only checked with actual conversions in the first trimester of 2019, but Kaggle doesn't provide this data, which was used to evaluate competitors.

skills: python, pandas, PCA, feature selection, scikit-learn (decision tree, SVM, RandomForest, Clustering), LGBM, XGBoosting, CATboost, Stacking.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.gitignore		.gitignore
README.md		README.md
README.sp.md		README.sp.md
galicia1.ipynb		galicia1.ipynb
respuesta_final		respuesta_final

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

👽 Galicia Datathon 👽

About

Releases

Packages

Languages

iseka-dev/Galicia-datathon

Folders and files

Latest commit

History

Repository files navigation

👽 Galicia Datathon 👽

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages