This repository was archived by the owner on Apr 12, 2026. It is now read-only.
v0.2.0 – Now eating less RAM
Added
--checkpointoption to load an existing lightning checkpoint- DDP sharding is now also possible with
ddp_spawn
Changed
- Text datasets are now loaded line-by-line by default and the block mode has been removed.
- We now use 🤗 datasets as backend, so the datasets are
implemented as memory-mapped files with dynamic loaders instead of being held in RAM. This
significantly decrease RAM consumption for a very decent speed cost and allows us to train on much
larger datasets. - GPU usage is now logged in
--profilemode when relevant. - LR is now logged.
Removed
- The
--line-by-lineflag has been removed, since this is now the default behaviour. - The
zeldarose-create-cachehas been removed, since dataset processing now works correctly in
ddp. - The
datamodule has been completely rewritten and the Dataset classes are no more. mlm.masked_accuracysince it was not used anywhere.
Fixed
- Logging has been improved for internal pytorch warnings and pytorch-lightning and 🤗 transformers.