Skip to content

Course Material for my part of the Data Mining (JBI030) course

License

Notifications You must be signed in to change notification settings

davidevdt/datamining_jbi030

Repository files navigation

JBI030: Data Mining course material

In this repository, you can find the course material - including jupyter notebooks - concerning my lectures of Data Mining (JBI030), valid for the Data Science program (TUE/UvT).

In particular, my part of the course will cover:

  1. Data Preprocessing
  2. Model Selection and Evaluation
  3. Logistic Regression
  4. Linear/Kernelized SVM
  5. Decision Trees
  6. K-Nearest-Neighbors
  7. Neural Networks
  8. Ensemble Learning

During the lectures, I will present the theory of the listed models/techniques. Jupyter notebooks contain the relevant python code needed to run such methods. In particular, we will make use of the scikit-learn package (and Keras, for the Neural Networks part). Notice that scikit-learn requires the installation of other packages, among which the main ones are:

  • numpy
  • pandas
  • matplotlib

See the file JBI030_course_software.pdf for further information.

Relevant Material

The main reference for the course is the scikit-learn documentation, which contains an excellent theoretical introduction to the various methodologies, as well as a detailed technical explanation of its functions.

Other suggested readings for more detailed insights are:

Cloning the repository

To clone the repository into your local machine, you can run from terminal:

git clone https://github.com/davidevdt/datamining_jbi030

New jupyter notebooks related to the correponding course lectures will be progressively added at the end of each class; to fetch the new lectures into the local folder, place your terminal into the folder directory and type

git pull

Releases

No releases published

Packages

No packages published