Skip to content

equinor/force-ml-2020-wells

Repository files navigation

FORCE 2020 Machine Learning Contest - Well logs

This GitHub repo relates to the Well Log competition of the FORCE 2020 Machine Learning Contest, sponsored by Equinor.

The objective of the competition is to correctly predict lithology labels for provided well logs, provided NPD lithostratigraphy and well X, Y position. The training dataset contains 98 wells, whereas the test dataset includes 10 wells.

Firstly, the team went through an Exploratory Data Analysis (EDA) phase, aiming to have a better understanding of the dataset by looking at different types of plots (boxplots, cross-plots, etc) and data completeness. Secondly, a preprocessing phase aimed to perform some cleaning and feature engineering in order to reinforce the model. The modelling strategy consisted in using XGB and CatBoost classifiers to predict the lithology labels based on the different features available. The model had been applied to regional clusters in an attempt to capture local geological information. Some learnings can also be taken away concerning data visualization (well logs visualization, map view, QC, etc).

A general presentation can be found on SharePoint following this link.

More information can be found on the Xeek platform for this event.

The link to the model weights is: link

Note1: the force_h3g.py script, the test dataset and the model weights should be in the same folder.

Note2: the default of the force_h3g.py script is to run prediction for file 'test.csv'. If you want to train the models, change the parameter 'train' from False to True.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •