S5: Simplified State Space Layers for Sequence Modeling

This repository provides the official implementation and experiments for the paper: Simplified State Space Layers for Sequence Modeling. The preprint is available here. The core contribution is the S5 layer which is meant to simplify the prior S4 approach paper while retaining its performance and efficiency.

While it has departed a fair amount, this repository originally started off with much of the JAX implementation of S4 from the Annotated S4 blog post by Sasha Rush (available here).

Experiments

The Long Range Arena and Speech Commands experiments in the paper were performed using the dataloaders from the Official S4 repository. We are currently in the process of adding dataloaders better suited for our JAX implementation.

We currently provide the ability to run the LRA-Text (character level IMDb classification) experiment easily in a Google Colab notebook (). The rest of the experiments will be added soon!

Requirements

If using the Google Colab environment, all requirements are already installed.

To run the code on your own machine, you will need to first install JAX following the instructions at: https://github.com/google/jax#installation, before installing the requirements.txt file.

Repository Structure

data/            default location of data files
Examples/        includes Colab Notebook examples of experiments
src/             source code for models, datasets, etc.
    dataloading.py   dataloading functions
    layers.py        Defines the S5 layer which wraps the S5 SSM with nonlinearity, norms, dropout, etc.
    seq_model.py     Defines deep sequence models that consist of stacks of S5 layers
    ssm.py           S5 SSM implementation
    ssm_init.py      Helper functions for initializing the S5 SSM 
    train.py         training loop entrypoint
    train_helpers.py functions for optimization, training and evaluation steps

Citation

Please use the following when citing our work:

TBD

Please reach out if you have any questions.

-- The S5 authors.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.vscode		.vscode
s5		s5
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S5: Simplified State Space Layers for Sequence Modeling

Experiments

Requirements

Repository Structure

Citation

About

Releases

Packages

Languages

License

kavorite/S5

Folders and files

Latest commit

History

Repository files navigation

S5: Simplified State Space Layers for Sequence Modeling

Experiments

Requirements

Repository Structure

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages