Skip to content

Files

Latest commit

author
Xiao Shi Liang
May 31, 2019
1789c91 · May 31, 2019

History

History

python-pctr

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
May 31, 2019
May 31, 2019
May 31, 2019
May 31, 2019

Introduction

This project presents our ML solution for IJCAI 2018 Alimama pCTR competition.

Composition

  • run.ipynb: Jupyter Notebook of our solution.
  • data_processing.py: Feature engineering part of our solution.
  • datasets: train and test datasets of the competition can be found here.

Feature Engineering

Following categories of features are extracted.

  • User Features: features describing user properties (such as age, gender) and user behaviors (such as user's queries during a day/hour).
  • Item Features: features describing item properties.
  • Shop Features: features describing shop properties.
  • Context Features: features describing context properties.
  • Other Features: such as instance id, day, hour, etc.

ML algorithm

Our algorithm is based on LightGBM, which is a fast, distrbuted and high performance gradient boosting framework based on decision tree algorithms. Specifically, we use the LGBMClassifier to efficiently train our model and predict on unknown instances.

Outcome

Our solution achieved top 10% scores in the competition.