Unveiling the effectiveness of the resurrecting RNNs

Official repository of the project

Abstract: Capturing long range dependencies is a fundamental challenge in many machine learning tasks including natural language processing and time series analysis. In a recent series of works, State Space Models (SSMs) have emerged and proven to be extremely effective in modeling such dependencies, notably surpassing transformers in benchmarks such as Long Range Arena (LRA). At their core, SSMs are recurrent neural networks (RNNs) with a linear update to the hidden state, enabling efficient implementation and training on very long sequences. Many variants of SSMs have been proposed with different architectural designs. To this date, it is still unclear theoretically why SSMs are so effective. This paper empirically investigates the effect of different design choices on the optimization and generalization of SSMs.

Repository Organization

File name	Content
`/configs/table2/mnist_guess_rnn.yaml`	Configurations file for the different experiments
`train_distributed_same_seeds.py`	Script to restore the results presented in the project's report

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
configs/table2		configs/table2
output		output
scripts		scripts
.gitignore		.gitignore
CreateResultsExcel.py		CreateResultsExcel.py
CreateResultsPresentation.py		CreateResultsPresentation.py
README.md		README.md
calc_best_test_accuracies_and_perfect_models_percentage.py		calc_best_test_accuracies_and_perfect_models_percentage.py
dataset_check.py		dataset_check.py
datasets.py		datasets.py
efficient_train_distributed_same_seeds.py		efficient_train_distributed_same_seeds.py
environment.yml		environment.yml
evaluate_minimas.py		evaluate_minimas.py
evaluate_slab_model_bins.py		evaluate_slab_model_bins.py
gpu_check.py		gpu_check.py
main.py		main.py
main_DEBUG.py		main_DEBUG.py
optimizer.py		optimizer.py
plot_loss_hitograms.py		plot_loss_hitograms.py
requirements.txt		requirements.txt
rnn_model.py		rnn_model.py
scale_finding.py		scale_finding.py
sql.py		sql.py
train_distributed.py		train_distributed.py
train_distributed_same_seeds.py		train_distributed_same_seeds.py
utils.py		utils.py
visualize_datasets.py		visualize_datasets.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unveiling the effectiveness of the resurrecting RNNs

Official repository of the project

Repository Organization

About

Releases

Packages

Languages

talrub/DeepLearningCourseProject

Folders and files

Latest commit

History

Repository files navigation

Unveiling the effectiveness of the resurrecting RNNs

Official repository of the project

Repository Organization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages