Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

microsoft / DeepSpeed Public

Notifications You must be signed in to change notification settings
Fork 4.1k
Star 35.3k

Code
Issues 956
Pull requests 122
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: microsoft/DeepSpeed

Labels 33 Milestones 0

Labels 33 Milestones 0

New pull request New

122 Open 2,891 Closed

122 Open 2,891 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix issue #6671

#6698 opened Nov 1, 2024 by stephen-nju

Loading…

2

Reduce the device bubble introduced by heavy loop synchronization in coalesced fetch/release(z3_leaf_module)

#6694 opened Oct 31, 2024 by inkcherry

Loading…

Update MII tests to support transformers latest

#6686 opened Oct 29, 2024 by loadams

Loading…

Add no_sync context manager

#6675 opened Oct 27, 2024 by tjruwase

Loading…

2

Allow to compile collective for PT > 2.3

#6674 opened Oct 27, 2024 by nelyahu

Loading…

1

Use one param coordinator for both train/inference scenarios

#6662 opened Oct 23, 2024 by tohtana

Loading…

A faster and more memory-efficient implementation of zero_to_fp32

#6658 opened Oct 23, 2024 by xu-song

Loading…

Support the parallel conversion from ZeRO checkpoints to FP32/FP16/BF16 param weight

#6655 opened Oct 23, 2024 by xylian86

Loading…

5 tasks done

2

add zero3 coalesced parameters fetch to zero optimization.

#6649 opened Oct 21, 2024 by inkcherry

Loading…

8

AIO File Offsets

#6641 opened Oct 18, 2024 by jomayeri

Loading…

[DO NOT MERGE] Log test results to file

#6627 opened Oct 15, 2024 by tohtana • Draft

6

modify_load_save_model

#6626 opened Oct 15, 2024 by ssklzx

Loading…

3

[Bug Fix] Support threads_per_head < 64 for wavefront size of 64

#6622 opened Oct 11, 2024 by jagadish-amd

Loading…

Improve consistency of zero_grad

#6554 opened Sep 18, 2024 by tohtana • Draft

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

#6553 opened Sep 18, 2024 by gyou2021

Loading…

4

Set shuffle=True by default in data_sampler

#6531 opened Sep 13, 2024 by ranzhejiang

Loading…

1

Change compile for pipeline module torch.compile

#6478 opened Sep 2, 2024 by NirSonnenschein

Loading…

4

Adding the new feature of FPDT

#6462 opened Aug 29, 2024 by YJHMITWEB

Loading…

Unpin tests that previously used a pinned version of transformers

#6387 opened Aug 20, 2024 by loadams

Loading…

1

Add weights_only=True in torch.load

#6094 opened Aug 17, 2024 by terry-for-github

Loading…

8

[NaN check] Add NaN check to support bfloat16.

#5879 opened Aug 8, 2024 by ys950902

Loading…

2

Fix circular import in ds_transformer.py

#5804 opened Jul 28, 2024 by sznmelvin

Loading…

3

Add DataStates-LLM: Asynchronous Checkpointing Engine Support

#5763 opened Jul 10, 2024 by mauryaavinash95 • Draft

3

Switch what versions of python are supported

#5676 opened Jun 17, 2024 by loadams

Loading…

1

Hybrid Offloading for ZeRO3

#5625 opened Jun 7, 2024 by tohtana • Draft

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Updated in the last three days: updated:>2024-10-31.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.