Skip to content

yimeitang/personalizeExpediaHotelSearches

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

personalizeExpediaHotelSearches

The original data set is obtained from Kaggle.com and consists of a representative sample of 99,917,530 hotels. Also, 52 features are included, such as: price, user history, competitors, etc.

  1. The booking rate is 2.74%.
  2. The click-through-rate is 4.44%
  3. The conversion rate is 61.6 %

Step 1: Analyze and clean the data(it is a sparse dataset with 10M rows and 52 features)

booking_vs_location booking_vs_review_score missing_data mice_imputation

Step 2: Conduct PCA ,CFA and other feature selection methods to choose most relevant features

pca extratreeclassifier feature_selection_results

Step 3: Build different models in CV to achieve better accuracy

decision_tree random_forest

Step 4: Compare model performance.

model performance comparison

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published