armeetjatyani/autograd

Scalar value autograd engine and neural network library with PyTorch-like API, written from scratch with no dependencies (excluding random) in ~500 lines of code. Recreation of micrograd.

armeetjatyani/autograd
- autograd/nn.py - simple nn library
- autograd/engine.py - scalar value autograd engine
  - engine.Value
  - Minimize (demo)

`autograd/nn.py` - simple nn library

The nn library is built on top of the autograd engine (see below). Read about the Neuron, Layer, and Net objects below. Here is a demo showing how a simple network can be trained. Note that this library is meant as an educational exercise. Everything is running on the CPU. Nothing is parallelized. So, we use a tiny example to show the capabilities of this tiny library.

tiny_train (demo)

Train a simple nn.Net to fit a tiny 4 example dataset.

Demo Code

from autograd.engine import Value
import autograd.nn as nn
import tqdm

X = [[1, 2, 3], [2, 3, 4], [-3, 4, -2.3], [-5, 6, -2]]
y = [0, 1, 0, 1]

# construct model
model = nn.Net(3, [10, 1])
print(model)
print(f"trainable_params={model.nparams()}")

lr = 0.01
epochs = 5000

pbar = tqdm.trange(1, epochs + 1, desc="[loss: xxxxxx]")
for epoch in pbar:
    loss = 0
    # forward
    for x, yi in zip(X, y):
        loss += (model(x) - yi) ** 2
    pbar.set_description(f"[loss={str(loss.data)[:6]}][epoch={epoch}]")
    # backward
    model.zero_grad()
    loss.backward()

    # update params
    # does same thing as model.step(lr)
    for p in model.params():
        p.data -= lr * p.grad

print("y_actual:", y)
print("y_pred(class):",[round(model(x).data) for x in X])
print("y_pred:", [model(x).data for x in X])

tiny_train.mp4

sin_train (demo)

In this demo, we train a neural network to fit the sine function.

Demo Code

from autograd.engine import Value
import autograd.nn as nn

"""
generate a dataset for the target function
f(x) = sin(x) + noise
"""
noise = lambda: random.uniform(-0.1, 0.1)
f = lambda x: math.sin(x[0])
X = [[random.uniform(-2 * math.pi, 2 * math.pi)] for i in range(50)]
y = [f(x) + noise() for x in X]

# construct model
model = nn.Net(1, [20, 20, 1])
print(model)
print(f"trainable_params={model.nparams()}")

lr = 0.001
epochs = 500

pbar = tqdm.trange(1, epochs + 1, desc="[loss: xxxxxx]")
for epoch in pbar:
    # forward
    loss = 0
    for x, yi in zip(X, y):
        loss += (model(x) - yi) ** 2
    pbar.set_description(f"[loss={str(loss.data)[:6]}][epoch={epoch}]")

    # backward
    model.zero_grad()
    loss.backward()

    # update params
    model.step(lr)

plt.figure()
X_true = list(np.linspace(-2 * math.pi, 2 * math.pi, 100))
y_true = [math.sin(x) for x in X_true]
plt.plot(X_true, y_true, label="actual", c="gray")
plt.scatter(X_true, [model([x]) for x in X_true], label="predicted")
plt.show()

Net([1, 20, 20, 1]): ['Layer(N_in=1, N_out=20)', 'Layer(N_in=20, N_out=20)', 'Layer(N_in=20, N_out=1)']]
trainable_params=481
[loss=2.9641][epoch=999]: 100%|███████████████████████████████████████████████████████████████████████████████████| 1000/1000 [09:38<00:00,  1.73it/s]

`nn.Neuron`

Every neuron stores parameters (weights and bias) as Value objects that keep track of gradients. To evaluate a neuron, inputs are multiplied by weights and added to the bias. Finally, the resulting signal is activated by a sigmoid function. This entire expression is internally represented by a DAG, to enable backwards gradient calculation (by autograd.engine).

Usage:

from autograd.nn import Neuron
n = Neuron(10)

print(n.nparams())
# 11

print(n.params())
# [Value(-0.8269341980053548, grad=0.0), Value(-0.9198909182804311, grad=0.0), Value(0.23878371951669064, grad=0.0), Value(-0.9616815732362081, grad=0.0), Value(0.7005652557465922, grad=0.0), Value(-0.34538779319877766, grad=0.0), Value(0.8949940702869532, grad=0.0), Value(-0.9398044368005902, grad=0.0), Value(0.009769044293206797, grad=0.0), Value(0.03367950339845449, grad=0.0), Value(0.0, grad=0.0)]

# evaluate the neuron, pass in 10 inputs
print(n([1, 2, 3, 4, 5, 6, 7, 8, 9, 10]))
# Value(0.9944882835077717, grad=0.0)

We initialize this neuron to take in 10 inputs. This neuron has 11 total parameters (10 weights + 1 bias).

`nn.Layer`

A Layer is an abstraction built on top of the Neuron class. Layers contain identically shaped neurons. That is, each neuron in the layer takes in the same number of inputs. The number of neurons in the layer is the number of the outputs of this layer.

In summary, a layer takes in N_in inputs and returns N_out outputs.

Usage:

from autograd.nn import Layer
l = Layer(3, 10)
print(l)
# Layer(N_in=3, N_out=10)

print(l.nparams())
# 40

# evaluate the layer (pass in 3 inputs, get 10 outputs)
print(l([1, 2, 3]))
"""
[Value(0.48653402984121236, grad=0.0),
 Value(0.21033094069723815, grad=0.0),
 Value(0.9697836303741563, grad=0.0),
 Value(0.06583501067003876, grad=0.0),
 Value(0.4700712216294792, grad=0.0),
 Value(0.8908593203427052, grad=0.0),
 Value(0.2655114954121276, grad=0.0),
 Value(0.32645743048687054, grad=0.0),
 Value(0.18878689965997322, grad=0.0),
 Value(0.8124174289699577, grad=0.0)]
"""

In this example we build a layer with 10 neurons. Each neuron has 4 trainable parameters (3 weights + 1 bias). This yields a total of 40 trainable parameters.

`nn.Net`

Net is a multilayer perceptron built by chaining together multiple Layer objects. Layers are fully connected.

Usage:

net = Net(3, [100, 200, 2])
print(net)
# Net([3, 100, 200, 2]): ['Layer(N_in=3, N_out=100)', 'Layer(N_in=100, N_out=200)', 'Layer(N_in=200, N_out=2)']]

print(net.nparams())
# 21002

print(net([1, 2, 3]))
# [Value(0.9786990460374885, grad=0.0), Value(0.9993422981186754, grad=0.0)]

In this example, we create a net that has 3 fully connected layers. The net ultimately takes in 3 inputs and yields 2 outputs (ignoring the hidden layers).

Layer 1: 3 inputs --> 100 outputs
Layer 2: 100 inputs --> 200 outputs
Layer 3: 200 inputs --> 2 outputs

`autograd/engine.py` - scalar value autograd engine

`engine.Value`

We can construct expressions using the Value class. Internally, the library maintains a graph representation of variables and their dependencies. We can calculate all gradients of parameters of an expression by calling .backward() on the result of an expression. This can be thought of as a simplified version of PyTorch's Tensor, holding scalar floats rather than tensors.

from autograd.engine import Value
x = Value(2)
y = Value(2.3)
print(x / y) # Value(0.8695652173913044, grad=0.0)

# simple expressions
print(2 + x) # Value(4.0, grad=0.0)
print(x / 0) # ZeroDivisionError
print(x ** 3) # Value(8.0, grad=0.0)
print(x.exp()) # Value(7.3890560989306495, grad=0.0)

# non-linearities
z = Value(-3)
print(z.relu()) # Value(0.0, grad=0.0)
print(z.sigmoid()) # Value(0.9525741268224331, grad=0.0)

Minimize (demo)

In the following example, we'll minimize (x-y)^3 + e^y over many iterations. In each step, we perform a forward pass, backward pass (to calculate gradients), and then update the parameters x and y according to their calculated gradients.

import numpy as np
from autograd.engine import Value

# initialize params
x = Value(7)
y = Value(3)
verbose = True
lr = 0.001
epochs = 20000
xs = np.arange(0, epochs + 1, 1)
ys = []

for i in range(epochs + 1):
    # forward
    f = (x - y) ** 3 + y.exp()
    f = f.relu() + y
    ys.append(f.data)

    # backward
    x.zero_grad()
    y.zero_grad()
    f.backward()

    # update params
    x.data -= lr * x.grad
    y.data -= lr * y.grad

The following video shows the value of the expression over thousands of iterations. We expect the value to grow smaller and smaller.

minimize.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
autograd		autograd
demo		demo
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

armeetjatyani/autograd

`autograd/nn.py` - simple nn library

tiny_train (demo)

sin_train (demo)

`nn.Neuron`

`nn.Layer`

`nn.Net`

`autograd/engine.py` - scalar value autograd engine

`engine.Value`

Minimize (demo)

About

Languages

armeetj/autograd

Folders and files

Latest commit

History

Repository files navigation

armeetjatyani/autograd

autograd/nn.py - simple nn library

tiny_train (demo)

sin_train (demo)

nn.Neuron

nn.Layer

nn.Net

autograd/engine.py - scalar value autograd engine

engine.Value

Minimize (demo)

About

Topics

Resources

Stars

Watchers

Forks

Languages

`autograd/nn.py` - simple nn library

`nn.Neuron`

`nn.Layer`

`nn.Net`

`autograd/engine.py` - scalar value autograd engine

`engine.Value`