ABC Transformer from scratch in rust

[!WARNING] This is an experimental repository that aims to test out the following idea. At this stage, I do not intent to put a lot of effort into making this serviceable to others in all aspects.

I want to write more rust + build and train a transformer from scratch. By from scratch I mean I am minimizing the use of external libraries, not even ndarray.
I am wondering just how efficient I can make training a transformer be. Training any neural network on a GPU is the default these days, but based on some papers 1, 2, I am curious whether there are ways to not rely on massive dense computation as much.

Map of this repo

You can find a working note in note.typ/note.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Readme.md		Readme.md
note.pdf		note.pdf
note.typ		note.typ

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ABC Transformer from scratch in rust

Map of this repo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ABC Transformer from scratch in rust

Map of this repo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages