You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This code demonstrates the basic end-to-end workflow of developing, training, and evaluating a deep artificial neural network classifier on a real-world classification problem involving preprocessing of categorical variables.
This repository contains one of the pre-requisite notebooks for my internship as a Data Analyst at Technocolabs. It includes some of the micro-courses from kaggle.
CUHK Course code: STAT 3011 | This course is designed to strengthen students' ability in statistical computing as well as in processing and analysing data. Students are required to participate in several term projects with emphasis on techniques of data management and analysis.
This repository explores various techniques for handling categorical variables in data preprocessing, focusing on methods such as one-hot encoding, label encoding, and their applications in machine learning models.
The project involves the study of performance analysis of the missForest imputation method for imputing continuous and categorical variables simultaneously.