Image Source: FORE School Of Management
This repo is developed as a course portfolio for the graduate class "INDE 577 - Data Science and Machine Learning" at Rice University, Sping 2023.
In this course, multiple algorithms for supervised, unsupervised, and reinforcement learning are discussed. The machine learning algorithms are implemented from scracth as well as uisng state-of-the-art packages such as Scikit-Learn, Keras, and Tensorflow.
Machine learning and data science are two rapidly growing fields in the world of technology. They are often used together to analyze and extract insights from large datasets.
Data science is the study of data, its structure, and its analysis. It involves using statistical and computational techniques to extract meaningful insights from data. Data science includes tasks such as data cleaning, data visualization, and statistical modeling.
Machine learning is a subset of artificial intelligence that involves training machines to learn patterns and make predictions based on data. It involves developing algorithms that can learn from data and improve their performance over time. Machine learning is often used for tasks such as image recognition, natural language processing, and predictive modeling.
Machine learning is a tool used by data scientists to extract insights from data. Data scientists use machine learning algorithms to identify patterns and trends in large datasets. Machine learning can also be used to develop predictive models that can be used to forecast future trends or make decisions based on data.
- Data science involves the study of data, while machine learning involves developing algorithms that can learn from data. Together, they form a powerful combination that can be used to solve complex problems and make data-driven decisions.
The following is the list of algorithms developed and discussed in this portfolio.
- Perceptron
- Gradient Descent
- Single Neuron Linear Regression
- Logistic Regression
- Deep Neural Network
- K-Nearest Neighbors
- Decision Trees
- Ensemble Learning - Random Forest
- Support Vector Machines
All datasets used in this work are publically available. The source and description of each dataset used is provided within the respective model.
Python, Visual Studio Code, Jupyter Nootbooks, Scikit-learn, TensorFlow, Keras, Matplotlib, Numpy, Pandas, Seaborn.
Kashif is a Ph.D. student in the department of Mechanical Engineering at Rice University. His areas of research are energy systems, thermal fluids, and machine learning. Check the following links to know more about Kashif: Personal Web, Google Scholar.