gonet

Motivation

gonet is a neural network library written in Go as a project for learning, experimentation, and fun. It is not intended for production use. The goal is to make neural-network internals easier to inspect by implementing the core pieces directly: fully connected layers, embeddings, attention, normalization, losses, training loops, and automatic differentiation.

The repository contains two independent implementations:

The root gonet package builds dynamic computation graphs and performs reverse-mode automatic differentiation.
The arrimpl package contains an array-based fully connected network with explicit forward and backward propagation (which is meant for double check).

Examples

The examples/ directory contains small demonstrational mini-projects that exercise different parts of the framework:

Binary classifier - trains a small neural-net binary classifier inspired by Karpathy's micrograd introduction. It can run with either the computation-graph implementation or the array-based MLP.
Digit OCR - trains a digit classifier on sklearn digits or MNIST-style data, again with both graph and array-based training modes.
Word embedding - trains a tiny word embedding model and shows how similar words can converge toward similar learned vectors.
Makemore neural bigram - implements a character-level neural bigram language model, with both one-hot linear and embedding-based variants.
Makemore neural quadgram - implements an MLP character language model in the style of Bengio et al. 2003, using multiple previous characters as context.
Makemore WaveNet - builds a character language model with a WaveNet-like hierarchical structure.
Makemore decoder-only transformer - trains a small character-level GPT-style model with masked self-attention, multi-head attention, attention blocks, and token generation.

Decoder-Only Transformer Highlight

The decoder-only transformer example is the most sophisticated example in this repository for now. It demonstrates that this small Go deep-learning framework can express and train a relatively complex model: token embeddings, positional/context handling, masked self-attention, multi-head attention, stacked transformer blocks, and autoregressive character generation.

The example is still primarily educational. Performance is not competitive with production deep-learning frameworks (eg, PyTorch), but that is also the point: the model is implemented in a way that keeps the mechanics visible and hackable instead of hiding them behind highly optimized kernels (tensor operations).

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
arrimpl		arrimpl
assets		assets
examples		examples
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
attention.go		attention.go
attention_test.go		attention_test.go
doc.go		doc.go
embedding.go		embedding.go
go.mod		go.mod
go.sum		go.sum
layer.go		layer.go
loss.go		loss.go
loss_test.go		loss_test.go
model.go		model.go
node.go		node.go
node_test.go		node_test.go
norm.go		norm.go
norm_test.go		norm_test.go
operators.go		operators.go
train.go		train.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gonet

Motivation

Examples

Decoder-Only Transformer Highlight

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

gonet

Motivation

Examples

Decoder-Only Transformer Highlight

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages