This GitHub repo relates to the Well Log competition of the FORCE 2020 Machine Learning Contest, sponsored by Equinor.
The objective of the competition is to correctly predict lithology labels for provided well logs, provided NPD lithostratigraphy and well X, Y position. The training dataset contains 98 wells, whereas the test dataset includes 10 wells.
Firstly, the team went through an Exploratory Data Analysis (EDA) phase, aiming to have a better understanding of the dataset by looking at different types of plots (boxplots, cross-plots, etc) and data completeness. Secondly, a preprocessing phase aimed to perform some cleaning and feature engineering in order to reinforce the model. The modelling strategy consisted in using XGB and CatBoost classifiers to predict the lithology labels based on the different features available. The model had been applied to regional clusters in an attempt to capture local geological information. Some learnings can also be taken away concerning data visualization (well logs visualization, map view, QC, etc).
A general presentation can be found on SharePoint following this link.
More information can be found on the Xeek platform for this event.
The link to the model weights is: link