Skip to content

This is the final project for JCDS 0506 BDG Program about Email Classification

Notifications You must be signed in to change notification settings

Johanklemantan/Shopee-Email-Campaign-Prediction

Repository files navigation

Shopee Email Campaign Prediction

By Johan Klemantan

This is the final project for JCDS 0506 BDG Program about Email Classification

Background

Sending email is one of the alternatives used by Shopee to tell and promote certain event sale to users. With the ability to be able to predict whether user is reading or not reading the email, enable Shopee to evaluate their crew works during campaign period until event launch. This is because, when user is open the email, the possibility of that user know there is an event or promo will be highly increased and also increasing the possibility of that user do transaction during event period. Therefore, with the making of this model, hopefully Shopee able to build a strategy for future marketing.

Source : Kaggle

Goals

The goals of this project is to build model that able to predict whether user is going to read the email sent by Shopee. In this case, Shopee's priority is user.

Data Used

  • 18 columns and 73539 rows for train data
  • 6 columns and 127886 rows for user data

Train Data Information

  1. country_code = Code Number of Country in Shopee's user
  2. grass_date = Date when the email was sent
  3. user_id = Unique number each user
  4. subject_line_length = Character Length for subject of email sent
  5. last_open_day = How many days this user last open his email
  6. last_login_day = How many days this user last open his Shopee account
  7. last_checkout_day = How many days this user last checkout at his Shopee account
  8. open_count_last_[10/30/60]_ days = Total email that open during N days
  9. login_count_last_[10/30/60]_ days = Total times this user open his Shopee account during N days
  10. checkout_count_last_[10/30/60]_ days = Total times this checkout from his Shopee account during N days
  11. open_flag = Target Variable Whether this user open his email or not
  12. row_id = Row number

User Data Information

  1. user_id = Unique number each user
  2. attr_[1/2] = General attribute each user. Boolean type
  3. attr_3 = General attribute each user. Int type
  4. age = age of the user
  5. domain = domain email of each user

Data Visualization

Relationship Age to Promotion Email Read Rate and Total of Shopee User

Target Variable and Relationship of User Country to Predicting Promotion Email Read Rate

Result

After searching the least Total Loss, found the most fitted model is :

  1. Logistic Regression with Benchamrk and Threshold 0.49
  2. Saved money = 290 million Rupiah/month (only for Shopee user in Indonesia)
  3. Unnecessary Email 14.11 %
  4. Loss Potential Customer 4.01 %
  5. Total Loss 34.18 %

Dashboard

Home

Background

Database

Business Problem Visualization

Prediction

Results

Thank You for Your Attention :)

About

This is the final project for JCDS 0506 BDG Program about Email Classification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published