JBI030: Data Mining course material

In this repository, you can find the course material - including jupyter notebooks - concerning my lectures of Data Mining (JBI030), valid for the Data Science program (TUE/UvT).

In particular, my part of the course will cover:

Data Preprocessing
Model Selection and Evaluation
Logistic Regression
Linear/Kernelized SVM
Decision Trees
K-Nearest-Neighbors
Neural Networks
Ensemble Learning

During the lectures, I will present the theory of the listed models/techniques. Jupyter notebooks contain the relevant python code needed to run such methods. In particular, we will make use of the scikit-learn package (and Keras, for the Neural Networks part). Notice that scikit-learn requires the installation of other packages, among which the main ones are:

numpy
pandas
matplotlib

See the file JBI030_course_software.pdf for further information.

Relevant Material

The main reference for the course is the scikit-learn documentation, which contains an excellent theoretical introduction to the various methodologies, as well as a detailed technical explanation of its functions.

Other suggested readings for more detailed insights are:

Cloning the repository

To clone the repository into your local machine, you can run from terminal:

git clone https://github.com/davidevdt/datamining_jbi030

New jupyter notebooks related to the correponding course lectures will be progressively added at the end of each class; to fetch the new lectures into the local folder, place your terminal into the folder directory and type

git pull

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
data		data
img		img
.gitignore		.gitignore
0. numpy_pandas_matplotlib_tutorial.ipynb		0. numpy_pandas_matplotlib_tutorial.ipynb
1. data_preprocessing.ipynb		1. data_preprocessing.ipynb
10a. neural_networks.ipynb		10a. neural_networks.ipynb
10b. neural_networks_in_practice_keras.ipynb		10b. neural_networks_in_practice_keras.ipynb
10c. neural_networks_in_practice_scikit_learn.ipynb		10c. neural_networks_in_practice_scikit_learn.ipynb
11. ensemble_learning.ipynb		11. ensemble_learning.ipynb
2. model_performance.ipynb		2. model_performance.ipynb
3. model_selection.ipynb		3. model_selection.ipynb
3bonus. cross_validation_in_practice.ipynb		3bonus. cross_validation_in_practice.ipynb
4. logistic_regression.ipynb		4. logistic_regression.ipynb
5. linear_svm.ipynb		5. linear_svm.ipynb
6. kernelized_svm.ipynb		6. kernelized_svm.ipynb
7. naive_bayes.ipynb		7. naive_bayes.ipynb
8. decision_trees.ipynb		8. decision_trees.ipynb
9. knn.ipynb		9. knn.ipynb
JBI030_course_software.pdf		JBI030_course_software.pdf
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JBI030: Data Mining course material

Relevant Material

Cloning the repository

About

Releases

Packages

Languages

License

davidevdt/datamining_jbi030

Folders and files

Latest commit

History

Repository files navigation

JBI030: Data Mining course material

Relevant Material

Cloning the repository

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages