Thanks for the amazing work! Can the train_mamba.py be used to pretrain the model?
Thanks for the amazing work! Can the train_mamba.py be used to pretrain the model?