SDAIA Machine Learning Bootcamp Projects

Overview

During my tenure as a Machine Learning Bootcamp Intern at SDAIA (Saudi Data and Artificial Intelligence Authority), I embarked on an intensive journey into the realm of machine learning. The experience has been instrumental in shaping my understanding of machine learning concepts and preparing me for future challenges in the field.

Learning Path

Starting with the fundamentals of data preprocessing, I progressed through a range of topics, including:

Supervised Learning: Implemented various supervised learning models such as linear and logistic regression, decision trees, and more.
Unsupervised Learning: Gained experience in clustering and other unsupervised techniques.
Model Evaluation: Conducted statistical evaluations of models to ensure their accuracy and effectiveness.

Skills Acquired

By the conclusion of the bootcamp, I emerged equipped with the knowledge and skills to:

Design Machine Learning Models: From scratch, based on problem requirements.
Train Algorithms: Build and train models using diverse data sets.
Evaluate Models: Use appropriate metrics to assess the performance of algorithms and make informed decisions.

Projects

1. Hotel Booking Data Cleaning

Objective: Clean and preprocess hotel booking data for further analysis.
Key Tasks:
- Handled missing and duplicate data.
- Performed feature selection to remove irrelevant or redundant features.
- Applied data transformation techniques for uniformity in data.

2. Loan Approval System

Objective: Develop a machine learning model to predict whether a loan will be approved based on applicant information.
Key Tasks:
- Built a logistic regression model to classify loan applications.
- Conducted feature engineering to create new variables that improved model performance.
- Evaluated the model using precision and recall to balance correct approvals and rejections.

3. Diabetes Prediction Using Linear Regression

Objective: Predict the likelihood of diabetes onset in patients based on medical data.
Key Tasks:
- Cleaned and preprocessed medical data.
- Applied linear regression to predict diabetes onset based on factors like BMI, glucose levels, and age.
- Evaluated the model using mean squared error and visualized accuracy through scatter plots.

4. Swine Flu Detection Using Decision Trees

Objective: Create a decision tree model to detect swine flu cases based on patient symptoms.
Key Tasks:
- Preprocessed patient symptom data and handled missing values.
- Built a decision tree to classify cases as swine flu positive or negative.
- Pruned the tree to reduce complexity and improve generalization.

5. Smoker Detection Using Classification Models

Objective: Predict whether a person is a smoker based on demographic and medical data.
Key Tasks:
- Implemented logistic regression and K-Nearest Neighbors to classify individuals as smokers or non-smokers.
- Compared model performance using confusion matrices and AUC-ROC curves.
- Focused on improving recall to better identify smokers.

6. Predictive Modeling Using Regression Techniques

Objective: Use multi-linear regression to predict continuous outcomes.
Key Tasks:
- Worked on datasets requiring prediction of variables like housing prices or insurance premiums.
- Optimized models using regularization techniques like Ridge and Lasso regression.
- Evaluated model performance with R-squared and root mean square error (RMSE).

Key Takeaways

Gained hands-on experience with both supervised and unsupervised machine learning models.
Acquired proficiency in transforming raw data into actionable insights through preprocessing and feature engineering.
Mastered model evaluation techniques to ensure high performance and generalization on unseen data.
Learned how to optimize models through hyperparameter tuning and regularization techniques to improve accuracy and reduce overfitting.

Conclusion

The SDAIA Machine Learning Bootcamp has been a transformative experience, allowing me to deeply engage with machine learning concepts and real-world applications. Through practical projects, I honed my ability to design, train, and evaluate machine learning models, ensuring they are optimized for performance and accuracy. This experience has solidified my confidence in tackling complex data science challenges and leveraging machine learning to create impactful, data-driven solutions.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Assignment 1\ Hotel Booking Data Cleaning		Assignment 1\ Hotel Booking Data Cleaning
Assignment 2\ Implementing a loan approval.ipynb		Assignment 2\ Implementing a loan approval.ipynb
Assignment 3\Linear_Regression_for_Diabetes_Prediction.ipynb		Assignment 3\Linear_Regression_for_Diabetes_Prediction.ipynb
Assignment 4\ A_fusion_of_multi_linear_&_Logistic_Regression.ipynb		Assignment 4\ A_fusion_of_multi_linear_&_Logistic_Regression.ipynb
Assignment 5\ Decision_Trees_for_Swine_Flu_Detection.ipynb		Assignment 5\ Decision_Trees_for_Swine_Flu_Detection.ipynb
Asssignmet_6_A_Better_Smoker_Detector.ipynb		Asssignmet_6_A_Better_Smoker_Detector.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SDAIA Machine Learning Bootcamp Projects

Overview

Learning Path

Skills Acquired

Projects

1. Hotel Booking Data Cleaning

2. Loan Approval System

3. Diabetes Prediction Using Linear Regression

4. Swine Flu Detection Using Decision Trees

5. Smoker Detection Using Classification Models

6. Predictive Modeling Using Regression Techniques

Key Takeaways

Conclusion

About

Releases

Packages

Languages

Sadcem/SDAIA-ML-Bootcamp

Folders and files

Latest commit

History

Repository files navigation

SDAIA Machine Learning Bootcamp Projects

Overview

Learning Path

Skills Acquired

Projects

1. Hotel Booking Data Cleaning

2. Loan Approval System

3. Diabetes Prediction Using Linear Regression

4. Swine Flu Detection Using Decision Trees

5. Smoker Detection Using Classification Models

6. Predictive Modeling Using Regression Techniques

Key Takeaways

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages