Skip to content

Releases: microsoft/DeepSpeed

v0.5.3: Patch release

18 Sep 05:07
30537e7
Compare
Choose a tag to compare
[zero_to_fp32] adapt to 4-bytes alignment in z2 (#1372)

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

v0.5.2: Patch release

14 Sep 22:50
8e577c9
Compare
Choose a tag to compare
Update setup.py (#1361)

updated classifiers

v0.5.1: Patch release

26 Aug 22:24
49b6a63
Compare
Choose a tag to compare
Reducing the memory-overhead of creating model for multi-GPU run (#1244)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

DeepSpeed v0.5.0

17 Aug 05:29
f284324
Compare
Choose a tag to compare
  • Mixture of Experts (MoE) support
  • Curriculum learning

v0.4.5: Patch release

10 Aug 17:45
c543a41
Compare
Choose a tag to compare
Use correct input size for splits (#1284)

* Use correct input size for splits

* Use smarter partitioning

v0.4.4: Patch release

30 Jul 19:50
40c381d
Compare
Choose a tag to compare
[Doc] round_robin_gradients (#1261)

* Fix docstring

* Make screenshots clickable for easier viewing

* Navigation menu in alphabetical order; More clicable screenshots

* Rename 1Cycle doc

* Tweak naming

* Remove no longer used flag

* ZeRO3 Offload release

* Single GPU results

* Rearrange figures

* Single GPU text

* tweak intro

* zero3-offload section

* Add asynchronous i/o docs

* Fix print_per_steps doc

* Document round_robin_gradients

* Tweak description

* Trigger CI

v0.4.3: Patch release

13 Jul 16:14
bc451c0
Compare
Choose a tag to compare
revert part of #1220 (#1221)

https://github.com/microsoft/DeepSpeed/pull/1220 fixed the leak, but lead to another problem. reverting that part so that we could do release and will work on it after the release.

@jeffra

v0.4.2: Patch release

01 Jul 17:00
a029239
Compare
Choose a tag to compare
clean up logging (#1190)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

v0.4.1: Patch release

23 Jun 17:19
3b68984
Compare
Choose a tag to compare
remove torchvision dependency (#1178)

DeepSpeed v0.4.0