fast-tno

This repository aims to optimize the Tno operator proposed in the paper Toeplitz Neural Network for Sequence Modeling. The main equation is the following format:

$$ \mathrm{Tno}: \mathbf x \to \mathbf o,\\ \mathbf o= \mathbf T \mathbf x , \mathbf T\in \mathbb R^{n\times n}, \mathbf x, \mathbf o \in \mathbb R^{n\times 1}. $$

In practice, we use the Tno operator in each feature dimension, so the complete formula is as follows:

$$ \mathbf O[:, i]= \mathbf T_i \mathbf X[:, i], \\ \mathbf O[:, i]\in \mathbb R^{n\times 1}, \mathbf T_i\in \mathbb R^{n\times n}, \mathbf X[:, i]\in \mathbb R^{n\times 1}. $$

Although the theoretical complexity is $O(nd\log n )$, it is slower than Attention when $n$ is small, so there is still a lot of room for optimization.

Speed test

n vs time

(b = 8, d = 64).

Forward mode:

Backward mode:

d vs time

(b = 8, n = 2048).

Forward mode:

Backward mode:

b vs time

(d = 512, n = 2048).

Forward mode:

Backward mode:

Training speed compared to transormer

## fp32
transformer:
small: "wps": "5426.5", "ups": "1.32", "wpb": "4096
medium: "wps": "2630.2", "ups": "0.64", "wpb": "4096"

tnn(cuda)
small: "wps": "6100.3", "ups": "1.49", "wpb": "4096"
medium: "wps": "3081.4", "ups": "0.75", "wpb": "4096"

tnn(naive)
small: "wps": "5354.6", "ups": "1.31", "wpb": "4096"
medium: "wps": "2687.5", "ups": "0.66", "wpb": "4096"

## fp16
transformer:
small: "wps": "30159.9", "ups": "7.36", "wpb": "4096"
medium: "wps": "17607", "ups": "4.3", "wpb": "4096"

tnn(cuda)
small: "wps": "21793.7", "ups": "5.32", "wpb": "4096"
medium: "wps": "12290.8", "ups": "3", "wpb": "4096"

tnn(naive)
small: "wps": "13463", "ups": "3.29", "wpb": "4096"
medium: "wps": "7453.3", "ups": "1.82", "wpb": "4096"

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
image		image
log		log
profile		profile
src		src
.gitignore		.gitignore
README.md		README.md
fftconv_note.md		fftconv_note.md
generate_curve.py		generate_curve.py
note_cn.md		note_cn.md
speed_test.py		speed_test.py
speed_test_causal.py		speed_test_causal.py
tmp.log		tmp.log
tnn_profile.py		tnn_profile.py
value_test_causal.py		value_test_causal.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fast-tno

Speed test

n vs time

d vs time

b vs time

Training speed compared to transormer

Todo

Reference

About

Releases

Packages

Languages

Doraemonzzz/fast-tno

Folders and files

Latest commit

History

Repository files navigation

fast-tno

Speed test

n vs time

d vs time

b vs time

Training speed compared to transormer

Todo

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages