This repository was archived by the owner on Apr 12, 2026. It is now read-only.

v0.2.0 – Now eating less RAM

LoicGrobol released this 23 Apr 08:15

fb6bdaa

Added

--checkpoint option to load an existing lightning checkpoint
DDP sharding is now also possible with ddp_spawn

Changed

Text datasets are now loaded line-by-line by default and the block mode has been removed.
We now use 🤗 datasets as backend, so the datasets are
implemented as memory-mapped files with dynamic loaders instead of being held in RAM. This
significantly decrease RAM consumption for a very decent speed cost and allows us to train on much
larger datasets.
GPU usage is now logged in --profile mode when relevant.
LR is now logged.

Removed

The --line-by-line flag has been removed, since this is now the default behaviour.
The zeldarose-create-cache has been removed, since dataset processing now works correctly in
ddp.
The data module has been completely rewritten and the Dataset classes are no more.
mlm.masked_accuracy since it was not used anywhere.

Fixed

Logging has been improved for internal pytorch warnings and pytorch-lightning and 🤗 transformers.

Assets 2