Some example of ML projects using dataset from UCI https://archive.ics.uci.edu/ml/datasets.php and Kaggle
Using Pandas / Numpy / SciKit Learn / (Keras)
- Regression:
- Airfoil Self-Noise (1503x5 - 1 output):
- Tags: Exploration / Correlation matrix / StandardScaler / Random Forest / Grid Search
- Combined Cycle Power Plant (9568x4 - 1 output) :
- Tags: Exploration / Correlation matrix / StandardScaler / Random Forest / Grid Search
- Concrete Compressive Strength (1030x8 - 1 output):
- Tags: Exploration / Linear regression / KNN / MLP
- Californian house values (20640x9 - 1 output)
- from excellent book "Hands-on with machine learning 2nd Edition - A.Geron". Notebook can be found here
- Tags: Exploration / Vizualization / Correlation matrix / Imputer / OrdinalEncoder / OneHotEncoder / Pipeline / StandardScaler / ColumnTransformer / LinearRegression / DecisionTree / RandomForest / SVM / Grid Search / RandomizedSearchCV
- Airfoil Self-Noise (1503x5 - 1 output):
- Classification:
- Ttianic dataset (the famous one...)
- Tags: Exploration / LabelEncoder / StandardScaler / SVM / grid search
- Glass Identification (214x8 - 7classes)
- Tags: Exploration / Correlation matrix / StandardScaler / Random Forest / Confusion Matrix (std & normalized)
- Wine Quality / Color (6497x12 - 2classes)
- Tags: Exploration / Correlation matrix / StandardScaler / KNN / Confusion Matrix
- Fingers (https://www.kaggle.com/koryakinp/fingers)
- Tags: Exploration / Image manipulation / PCA / LabelEncoder / SGD / Confusion Matrix (std & normalized) / MLP / Keras ANN
- Ttianic dataset (the famous one...)