ML

Linear Regression

This technique is used to predict the real valued output or near-real valued output, on a given data set. In this technique the trend is analysed in prior data set and predict the values accordingly.

The line represents the predicted values, and dots( above and below) represents the actual values.

The General Equation of a line is: Y=mx+c

Our objective is to minimize the distance between two; for this calculate

Error Function

This gives the mean square distance of the data-sets values( Dots on above graph) and predicted values(line). This is given by: where y(i) represents actual data-values and mx+b represents predicted values. Where m,b are the parameter values that can be calculated by gradient descent.

We have to calculate values m,b; So that our Error function can be minimum:

Gradient Descent

Gradient Descent at a point gives the tangent to curve we are traversing. It gives us the direction whether to traverse up or down.

Learning Rate

Learning Rate is used to determine how fast to learn. The Learning Rate cannot be too high and can not be too low, we have to choose according to the given data set.

Spam Classifier

This Classification is based on Bayes Theorm (reference: ML for hackers). The text is extracted from all the e-mails and occurances, probability is calculated in each type of Corpuse (easy ham , Hard ham, Spam).

File Descriptions

All emails are in text format easyham - The messages which can be easily classified as ham(not spam). hardham - The messages which can not be easily classified as ham(not spam). spam - The messages which can be easily classified as spam. The probability of each keyword is calculated and tested against spam classifier.

Extract each message texts and compare against the obtained values.

This model got the following accuracy: Spam : 73% hardham : 93% ham : 98%

Special Thanks

Siraj Raval and Andrew Ng for awesome teaching. -XOXO

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
data.csv		data.csv
linear_regression.py		linear_regression.py
spamClassifier.R		spamClassifier.R
svm1.R		svm1.R
svm2.R		svm2.R
svm_classifier.py		svm_classifier.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML

Linear Regression

Error Function

Gradient Descent

Learning Rate

Spam Classifier

File Descriptions

Special Thanks

About

Releases

Packages

Languages

PixelSenseiAvi/ML

Folders and files

Latest commit

History

Repository files navigation

ML

Linear Regression

Error Function

Gradient Descent

Learning Rate

Spam Classifier

File Descriptions

Special Thanks

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages