Efficient transformers for financial data

Authors: Baikalov Vladimir, Kovaleva Maria, Shlychkov Konstantin, Vo Ngoc Bich Uyen

Problem

The attention-based methods and transformers made a significant breakthrough in the deep learning area and greatly impacted NLP task solutions ¹. Although recent works show that they could potentially improve results in different tasks domains, the application of transformer for financial data in particular transactions data is underexplored.

While applying attention mechanisms, one can face the apparent restriction on input sequence length due to the method's quadratic complexity. Recent papers proposed different ways to overcome this problem, but we want to concentrate on two promising approaches: Informer² and Performer ³.

The Informer is the most current and prospective approach. Its main assumption is that the model should have an "infinite memory" and fit a sequence with arbitrary length. The Performer model shows good results in the NLP task but is not well-explored for other datatypes. Its main idea is to use some trigonometric approximation of the attention matrix to decrease memory consumption.

To sum up, the project aims to compare several recent methods proposed to decrease the evaluation complexity in particular tasks predicting the user's gender based on transactions.

What have been done

baseline model (by Baikalov Vladimir)
training and data processing pipeline (by Baikalov Vladimir)
performer attention (by Shlychkov Konstantin)
informer attention (by Kovaleva Maria)
banchmarking all models in terms of speed and memory consumption (by Shlychkov Konstantin)
report (by all team members)
presentation (by all team members)

Code

You can see all models realization and results of experiments in this notebook, also you can repeat by yourself.

Or run:

python3 ./train.py --params ../configs/baseline_config_train.json
python3 ./train.py --params ../configs/performer_config_train.json
python3 ./train.py --params ../configs/informer_config_train.json

for training (it is required to use cuda)

and

python3 ./inference.py --params ../configs/baseline_config_inference.json
python3 ./inference.py --params ../configs/performer_config_inference.json
python3 ./inference.py --params ../configs/informer_config_inference.json

for experiments.

Results

Results of training

Model	Result of training
Baseline
Performer
Informer

Results of comparison in time and memory consumption

Time consumption comparison	Memory consumption comparison

Conclusion

As you can see models with performer and informer attention layers work faster and require and require much less memory than model with regular multihead attention. While memory and time consumption grows quadratically from the sequence length for the baseline model, models with informer and performer layers consume memory and time linearly.

Literature

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. - "Full attention" model ↩
Martins, Pedro Henrique, Zita Marinho, and André FT Martins. "∞-former: Infinite Memory Transformer." - "Informer" model ↩
Choromanski, Krzysztof, et al. "Rethinking attention with performers." arXiv preprint arXiv:2009.14794 (2020). - "Performer" model ↩

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
configs		configs
figures		figures
notebooks		notebooks
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Efficient transformers for financial data

Problem

What have been done

Code

Results

Results of training

Results of comparison in time and memory consumption

Conclusion

Literature

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Efficient transformers for financial data

Problem

What have been done

Code

Results

Results of training

Results of comparison in time and memory consumption

Conclusion

Literature

Footnotes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages