fine_tune to a text classification task #32

WilliamHoo · 2023-12-08T00:32:32Z

I am trying to get mamba working for a text classification task by adding a classification head after the model.

For transformer models, people usually the last_hidden_state as the input to the classification head, any suggestions for mamba?

WilliamHoo · 2023-12-08T00:33:27Z

also, any recommendations on tokenizers?

albertfgu · 2023-12-08T04:56:10Z

I don't know much about tokenizers for fine tuning. For the classification head, many variations are possible. You could grab the final recurrent state of the model (although that might be unsupported with the current released version); you could grab the output at the last timestep; you can average the outputs at all timesteps.

turian · 2023-12-20T19:21:33Z

@WilliamHoo If you do figure out how to do this, I would be curious

jmunozmendiFunditec · 2024-01-25T08:51:13Z

I would also be interested. Any news?

maksymdolgikh · 2024-02-15T15:06:52Z

See my comment here #163 for a suggestion.

getorca · 2024-04-29T19:17:18Z

I've put this, https://github.com/getorca/mamba_for_sequence_classification together to be compatible with HF to use mamba for sequence classification.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fine_tune to a text classification task #32

fine_tune to a text classification task #32

WilliamHoo commented Dec 8, 2023

WilliamHoo commented Dec 8, 2023

albertfgu commented Dec 8, 2023

turian commented Dec 20, 2023

jmunozmendiFunditec commented Jan 25, 2024

maksymdolgikh commented Feb 15, 2024

getorca commented Apr 29, 2024

fine_tune to a text classification task #32

fine_tune to a text classification task #32

Comments

WilliamHoo commented Dec 8, 2023

WilliamHoo commented Dec 8, 2023

albertfgu commented Dec 8, 2023

turian commented Dec 20, 2023

jmunozmendiFunditec commented Jan 25, 2024

maksymdolgikh commented Feb 15, 2024

getorca commented Apr 29, 2024