Skip to content

UOADataScience/machine-learning-tutorial

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

What is Machine Learning?

The difference between statistics and machine learning is that

  • Statistics' goal is to explain the relationship between different data, whereas
  • Machine learning aims to create a prediction based on collected data

Supervised Learning

Supervised learning are algorithms that are given examples for the right answers. The algorithms then learn from these examples, and generate predictions for new data.

Linear Regression

Linear regression is the most basic form of learning from data. Most of you who have taken statistic or math papers should have came across this topic frequently.

Essentially, the idea behind linear regression is that a "line of best fit" is drawn for the collected data, with the main objective of finding a relationship between 2 or more sets of data. An example below

Given a training set, a learning algorithm will generate weights based on collected data. Once the solution has converged, any new input will be passed through the calculated weights and a prediction will be generated.

More details are available on the wiki page.

Classification and Logistic Regression

For classification purposes, unlike linear regression, the output for logistic regression will only take on a small amount of discrete values. For an example, telling apart spam messages and otherwise.

Neural Network

"Neural network" is a name given to a nonlinear function approximator, whereby inputs are mapped to output through a series of complex functions.

Neural networks are typically drawn as such

where each node represents a function, and each edge represents a weight. For this diagram,

  • Inputs are represented by the green nodes
  • Each input node is multipled by 5 different weights
  • These outputs are then summed at each hidden layer node, and mapped to another output
  • Again, these outputs, represented by the 5 edges on the right of the hidden layer, are multipled by different weights
  • The output layer then sums these values and then maps them to a single output

Unsupervised Learning

K-Means Clustering Algorithm

Principal Component Analysis

Independent Component Analysis

Reinforcement Learning

Optimization

Practical Advice

Resources

Books

Online Lectures

MOOC

Available Packages

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published