Attn #138

AnFreTh · 2024-09-28T21:10:25Z

New Model Architectures:

MambAttn Class: Introduced a new model class MambAttn that alternates between Mamba blocks and attention layers, providing a flexible architecture for various deep learning tasks. (mambular/arch_utils/mambattn_arch.py)
ConvRNN Class: Added the ConvRNN class that combines convolutional layers with RNN layers, supporting various RNN types (RNN, LSTM, GRU) and optional residual connections. (mambular/arch_utils/rnn_utils.py)

Integration and Configuration:

MambAttention Model: Implemented the MambAttention model that leverages the MambAttn architecture, with support for various normalization techniques and pooling methods. (mambular/base_models/mambattn.py)
Model Registration: Registered the MambAttn model in the __init__.py of base_models to ensure it's accessible within the module. (mambular/base_models/__init__.py) [1] [2]

Optimization Enhancements:

Early Pruning and Optimizer Configuration: Enhanced the lightning_wrapper.py to include early pruning based on validation loss and dynamic optimizer configuration, allowing for more flexible and efficient training.
Include automatic bayesian HPO for all models -> config-mapper for automatic hparam-range detection
(mambular/base_models/lightning_wrapper.py) [1] [2]

AnFreTh added 18 commits September 4, 2024 17:54

include MambAttn

9998d38

adapt mabattn config

de0886e

pruning for hpo

d39c056

include config mapper for hpo

c1467bd

include gp_minimize in sklearn base regressor

85adc31

fix bug in TabTransformer embedding_layer

6a6a51b

mlp basemodel convenience fix

29811d6

resnet convenience fix

1cb93c5

config convenience fix

4ef4b5d

add hpo to classifier and lss

797b624

minor pooling error in mambular

5b05664

add conv layer to rnn for positional invariance

14663e0

add convrnn to base class

4bd83e9

adjust config of RNN

b3327a1

include optimizer args in taskmodel

3d76805

adapt sklearn classes to allow optimizer kwargs

81ae17f

adjust default optimizer

bce26cb

include skopt in requirements

28d56b6

AnFreTh merged commit ed5a0f3 into develop Sep 28, 2024

AnFreTh deleted the attn branch November 5, 2024 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attn #138

Attn #138

Uh oh!

AnFreTh commented Sep 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Attn #138

Attn #138

Uh oh!

Conversation

AnFreTh commented Sep 28, 2024

New Model Architectures:

Integration and Configuration:

Optimization Enhancements:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants