Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

microsoft / DeepSpeed Public

Notifications You must be signed in to change notification settings
Fork 4.1k
Star 34.9k

Code
Issues 989
Pull requests 132
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: microsoft/DeepSpeed

Releases Tags

Releases · microsoft/DeepSpeed

v0.5.3: Patch release

18 Sep 05:07

jeffra

v0.5.3

30537e7

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.5.3: Patch release

[zero_to_fp32] adapt to 4-bytes alignment in z2 (#1372)

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

Assets 2

All reactions

v0.5.2: Patch release

14 Sep 22:50

jeffra

v0.5.2

8e577c9

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.5.2: Patch release

Update setup.py (#1361)

updated classifiers

Assets 2

All reactions

v0.5.1: Patch release

26 Aug 22:24

jeffra

v0.5.1

49b6a63

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.5.1: Patch release

Reducing the memory-overhead of creating model for multi-GPU run (#1244)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

Assets 2

nikolaydubina reacted with heart emoji

All reactions

❤️ 1 reaction

1 person reacted

DeepSpeed v0.5.0

17 Aug 05:29

jeffra

v0.5.0

f284324

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

DeepSpeed v0.5.0

Mixture of Experts (MoE) support
Curriculum learning

Assets 2

All reactions

v0.4.5: Patch release

10 Aug 17:45

jeffra

v0.4.5

c543a41

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.4.5: Patch release

Use correct input size for splits (#1284)

* Use correct input size for splits

* Use smarter partitioning

Assets 2

All reactions

v0.4.4: Patch release

30 Jul 19:50

jeffra

v0.4.4

40c381d

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.4.4: Patch release

[Doc] round_robin_gradients (#1261)

* Fix docstring

* Make screenshots clickable for easier viewing

* Navigation menu in alphabetical order; More clicable screenshots

* Rename 1Cycle doc

* Tweak naming

* Remove no longer used flag

* ZeRO3 Offload release

* Single GPU results

* Rearrange figures

* Single GPU text

* tweak intro

* zero3-offload section

* Add asynchronous i/o docs

* Fix print_per_steps doc

* Document round_robin_gradients

* Tweak description

* Trigger CI

Assets 2

All reactions

v0.4.3: Patch release

13 Jul 16:14

jeffra

v0.4.3

bc451c0

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.4.3: Patch release

revert part of #1220 (#1221)

https://github.com/microsoft/DeepSpeed/pull/1220 fixed the leak, but lead to another problem. reverting that part so that we could do release and will work on it after the release.

@jeffra

Assets 2

nikolaydubina reacted with thumbs up emoji

All reactions

👍 1 reaction

1 person reacted

v0.4.2: Patch release

01 Jul 17:00

jeffra

v0.4.2

a029239

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.4.2: Patch release

clean up logging (#1190)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

Assets 2

All reactions

v0.4.1: Patch release

23 Jun 17:19

jeffra

v0.4.1

3b68984

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.4.1: Patch release

remove torchvision dependency (#1178)

Assets 2

All reactions

DeepSpeed v0.4.0

08 Jun 19:26

jeffra

v0.4.0

2d302d6

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

DeepSpeed v0.4.0

[Press release] DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression
New inference API inference setup
DeepSpeed Inference: Multi-GPU inference with customized inference kerenls and quantization support
- Getting Started with DeepSpeed for Inferencing Transformer based Models
Mixture-of-Quantization: A novel quantization approach for reducing model size with minimal accuracy impact
- MoQ tutorial for more details.

Assets 2

All reactions

Previous 1 2 3 4 5 6 7 8 Next

Previous Next

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.