why?

i wanted to kill two birds with one stone and do machine learning in golang
at least now i understand why we use python for data science :D

Writeup (incomplete)

Regression

Ah, regression. The statistical process of modelling relationships between a dependent variables and independent variable(s), enabling us to predict new values. Note that regression techniques are generally concerned with predicting continous values, as opposed to a discreet set of categories.

Linear regression

This brings us to possibly the most fundamental model, linear regression, expressed with the battle tested equation:

y = mx + b

Which describes a line with gradient m and y-intercept b.

One way of actually computing m and b is with the ordinary least squares method:

Randomise values for both m and b to create an example line
Find the distance between the example line and each value in the dataset (these distances are called 'errors'):

Sum the squares of these errors:

Now, we iteratively adjust the values of m and b in order to minimize this sum. A ubiquitous optimization technique to find the local minima is called gradient descent, but that's a topic for another day :)

The accuracy and performance of linear regression is dependant on its assumptions:

Linearity: there is a linear relationship between the dependant variable and the independant variable(s)
Normality: your variables are distributed normally
No multicollinearity: your independant variables should not be predictors of eachother, almost by definition
No auto-correlation: a fancy way of saying your variables should not depend on themselves, i.e they are not values in a time series; Tesla's share price, for example

Pitfalls:

Extrapolation beyond the model can quickly become very inaccurate
Extreme outliers can throw off the model

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
docs		docs
output		output
util		util
Petal length_scatter.png		Petal length_scatter.png
Petal width_scatter.png		Petal width_scatter.png
README.md		README.md
Sepal length_scatter.png		Sepal length_scatter.png
Sepal width_scatter.png		Sepal width_scatter.png
go.mod		go.mod
go.sum		go.sum
iris_headers.csv		iris_headers.csv
iris_setosa.csv		iris_setosa.csv
main.go		main.go
profile.go		profile.go
sales_data.csv		sales_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

why?

Writeup (incomplete)

Regression

Linear regression

About

Uh oh!

Releases

Packages

Uh oh!

Languages

eula01/go-linreg

Folders and files

Latest commit

History

Repository files navigation

why?

Writeup (incomplete)

Regression

Linear regression

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages