Skip to content

iuriishamkin/data-science-final-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Supervised, Unsupervised Learning & Data Visualization

Introduction

This is a final project for the Applied Data Science course. Main topics covered are data visualization, unsupervised learning (Multidimensional scaling, t-Distributed Stochastic Neighbor Embedding, PAM clustering), supervised learning (Multiple Linear and Log-linear Regressions, Decision (regression) trees and Random Forests) and related metrics. This project was planned to not only explore separate topics of supervised and unsupervised learning but also to compare them whereas possible, to determine the best approach to use considering presented data.

Working with two personal datasets, I performed data preparation, various data visualizations including regression trees, comparison of performing dimension reduction, clustering, and comparison of predictive models.

Datasets

Two personal datasets I've used:

About

This repo contains R code, datasets and a report for a final project in MATH 4990 (Data Science) at Thompson Rivers University, Winter 2018

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages