Skip to content
This repository was archived by the owner on Apr 12, 2026. It is now read-only.

v0.2.0 – Now eating less RAM

Choose a tag to compare

@LoicGrobol LoicGrobol released this 23 Apr 08:15

Added

  • --checkpoint option to load an existing lightning checkpoint
  • DDP sharding is now also possible with ddp_spawn

Changed

  • Text datasets are now loaded line-by-line by default and the block mode has been removed.
  • We now use 🤗 datasets as backend, so the datasets are
    implemented as memory-mapped files with dynamic loaders instead of being held in RAM. This
    significantly decrease RAM consumption for a very decent speed cost and allows us to train on much
    larger datasets.
  • GPU usage is now logged in --profile mode when relevant.
  • LR is now logged.

Removed

  • The --line-by-line flag has been removed, since this is now the default behaviour.
  • The zeldarose-create-cache has been removed, since dataset processing now works correctly in
    ddp.
  • The data module has been completely rewritten and the Dataset classes are no more.
  • mlm.masked_accuracy since it was not used anywhere.

Fixed

  • Logging has been improved for internal pytorch warnings and pytorch-lightning and 🤗 transformers.