Skip to content
@MAT388E-Spring23

MAT388E-Data Analysis in Fund. Sciences course by Gül İnan.

MAT388E-Data Analysis in Fundamental Sciences

Course Instructor: Gül İnan

Course Summary:

MAT388E is an undergraduate level course which aims to provide an introduction to commonly used statistical methods for inference and prediction problems in data analysis. This course is designed such that:

  • The methods covered will include supervised learning algorithms with a focus on regression and classification problems and unsupervised learning algorithms with a focus on clustering problems,
  • Application of these methods to data analysis problems and their software implementation will be done via Python.

At the end of the semester, the students are expected:

  • To be fluent in the fundamental principles behind several statistical methods,
  • To be able to apply statistical methods to real life problems and data sets, and
  • To be prepared for more advanced coursework or industrial internship in machine learning and related fields.

Course GitHub Organization: https://github.com/MAT388E-Spring23.

Course Prerequisites:

Since the course also touches on the mathematical and statistical theory behind the methods and uses Python for implementation, this course requires the following background:

Course Tentative Plan

We will closely follow the weekly schedule given below. However, weekly class schedules are subject to change depending on the progress we make as a class.

Week 1. Exploratory data analysis.

Week 2. Introduction to simple linear regression. Basic optimization concepts used in regression analysis. Ordinary least squares estimation. Gradient descent algorithms.

Week 3. Models evaluation metrics for regression problems. Multiple linear regression. Maximum likelihood estimation.

Week 4. Polynomial regression. Bias-variance trade-off. Over-fitting and under-fitting.

Week 5. Regularization methods for regression problems. Ridge and lasso regression.

Week 6. Cross-validation. Unsupervised pre-processing. Grid search and hyper-parameter tuning. Pipelines.

Week 7. Introduction to classification. Logistic regression. Evaluation metrics for binary classification algorithms. Decision boundary concept.

Week 8. ITU Fall Break.

Week 9. Multi-class classification. Naive Bayes. K-nearest neighbors.

Week 10. Linear discriminant analysis. Quadratic discriminant analysis. (Midterm Week)

Week 11. Tree based methods. Bagging, Random forests, and Boosting.

Week 12. Unsupervised learning. Principal component analysis.

Week 13. No class due to Labor and Solidarity Day.

Week 14. Clustering methods.

Week 15. Final review and applications.

Popular repositories Loading

  1. MAT388E-Spring23.github.io MAT388E-Spring23.github.io Public

    Website for MAT388E-Data Analysis in Fundamental Sciences taught by Gül İnan at ITU Math Eng in Spring 2023.

    CSS 2

  2. .github .github Public

Repositories

Showing 2 of 2 repositories
  • MAT388E-Spring23.github.io Public

    Website for MAT388E-Data Analysis in Fundamental Sciences taught by Gül İnan at ITU Math Eng in Spring 2023.

    MAT388E-Spring23/MAT388E-Spring23.github.io’s past year of commit activity
    CSS 2 GPL-3.0 0 0 0 Updated May 27, 2023
  • .github Public
    MAT388E-Spring23/.github’s past year of commit activity
    0 0 0 0 Updated Jan 31, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…