These are my projects for machine learning. I use regression analysis, logistic regression, hypothesis testing, time series and differnt models to train my data.
Analyze the SFO and LAX data sets and determine if either marketing campaign was successful in raising the average miles driven per Uber driver.
- Use logistic regression to predict when customers are going to transact
- Determine the causes for a transaction
- Evaluated the performance the model
- Use logistic regression to predict when people are going to leave a company
- Determine the causes for attrition
- Evaluated the performance the model
- Use regularization to predict salaries for a sports player
- Explain the output of the regularized models
- Remove/manipulate/transform features from the data set, remain only useful data
- Graphically and numerically describe model performance and find the relation between them
- Apply regression analysis techniques and EDA principles to find out what features will influence the rental price
- Trialed a list of different Machine Learning algorithms, such as Logistic Regression(with Lasso & Ridge), Decision Tree, KNN Classifier, and Random Forest Classifier, and Linear Regression to predict potential customer churn and customer life time value.
- Provided the best model that has achieved the highest AUC value with lowest MSE(Mean Squared Error).
- Contructed the particial dependece plot to discover how the most 6 importance features related to the customer churn.
- Implemented ARIMA model, analyzed 2 data sets to predict the values for the next 8 time periods and the subsequent 7 years (with confidence intervals), and make 3 observations about the data (i.e., describe its composition and characteristics).
- Using Moving Average, Exponential smoothing, AR and ARIMA model to forecast video CTR (click through rate)
- Select a performance measure for the model and pick the best performing model with lowest MSE.
- Determine the causes of active day
- Use multiple regression model to predict players' active day