Skip to content

Customer Personality Analysis Mini-Project for SC1015 - Introduction to Data Science and Artificial Intelligence

Notifications You must be signed in to change notification settings

BoonHianLim/SC1015-Customer-Personality-Analysis

Repository files navigation

SC1015-Group9

About:

This is a Mini-Project for the course SC1015 (Introduction to Data Science and Artificial Intelligence). The dataset used in this project is extracted from Customer Personality Analysis.

The source code, in order, are as follows:

  1. Data Exploration
  2. Data Cleaning
  3. Data Visualization
  4. Data Splitting
  5. Linear Regression
  6. Poisson Regression

Contributors:

  • Lim Boon Hian - Data Exploration, Data Cleaning, Data Visualization
  • Lim Dong Wan - Linear Regression, Data Splitting
  • Marvell Ung Wew - Poisson Regression, Grid Search Cross Validation

Problem Definition

Predicting customer sales using customer information

Variables in Focus

Response variables: TotalPurchase, MntGroceryProducts, MntWines, MntGoldProds

Predictor variables:

  • Categorical:
    • Education
    • Marital_Status
    • HaveChild
    • YearRange
  • Numerical:
    • Income
    • TotalChild
    • NumWebVisitsMonth

Machine Learning Models Used

  1. Linear Regression
  2. Poisson Regression

Insights and Recommendation

Main insight: The company utilizes customer-focused sales tactics with customer’s income as a guideline.

Recommendations:

  1. Target audience should be customers who have / are:
    • fewer children
    • higher income
    • higher levels of education
    • born in 1940 - 1950
  2. Improve online sales by upgrading company website, so as to make it more attractive and appealing to customers.

Conclusion

  1. Customer income serves as the most important predictor to predict customer expenditure
  2. Linear Regression yield better results than Poisson Regression
  3. GridSearch CV only marginally improved the Linear Regression model

Things Learnt

  1. K-fold splitting
  2. Poisson Regression
  3. Plotly subplot
  4. Grid Search Cross Validation
  5. Altair interactive plot
  6. Get dummy values for categorical variables in Linear Regression
  7. Collaborating on Github :)

References

About

Customer Personality Analysis Mini-Project for SC1015 - Introduction to Data Science and Artificial Intelligence

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published