Example scripts from the MSc Big Data Mining lectures
- Create your virtual environment and install the required dependencies:
virtualenv -p `which python3` venv
source venv/bin/activate
pip install -r requirements.txt
# You will probably need to install a Jupyter kernel:
ipython kernel install --user --name=venv
Otherwise, copy and paste the scripts to Google Colab
-
binning_example
-
chi_square_example
-
DWT (wavelet transform)
-
PCA (principal components analysis on dummy dataset)
-
normalization
-
k-means discretization
-
feature_selection
-
decision_trees
-
KNN
-
clustering_examples
-
Silhouette_coefficient
-
Apriori_example
-
Outlier detection example
-
Data drift example
-
Word2Vec example
-
Language modeling
-
A playground with various simple examples
-
A movie recommendation mini project