This repository was archived by the owner on Apr 12, 2026. It is now read-only.
Now with mBART translations!
The main highlight of this release is the addition of mBART training as a task, so far slightly different from the original one, but similar enough to work in our tests.
Added
- The
--tf32-modeoption allows to select the level of NVidia Ampère matmul otpimisations. - The
--seedoption allows to fix a random seed. - The
mbarttask allows training general seq2seq and translation models. - A
zeldarosecommand that serves as entry point for both tokenizer and transformer training.
Changed
- BREAKING
--use-fp16has been replaced by--precision, which allows to also use fp64 and
bfloat. Previous behaviour can be emulated with--precision 16. - Remove the GPU stats logging from the profile mode since Lightning stopped supporting it
- Switched TOML library from toml to
tomli - BREAKING Bumped the min version of several dependency
pytorch-lightning >= 1.8.0torch >= 1.12
- Bumped max version of several dependency
datasets < 2.10pytorch-lightning < 1.9tokenizers < 0.14