DeepSpeed v0.16.0
What's Changed
- Update version.txt after 0.15.4 release by @loadams in #6731
- Update GH hosted workflows to 24.04 by @loadams in #6717
- Add COMMITTER file by @tjruwase in #6741
- Update AMD apex version by @loadams in #6739
- Fix Type Name Inconsistency & Typo in cpu_adam by @xylian86 in #6732
- Add Domino code by @zhangsmallshark in #6733
- Add data type check for bf16 by @hwchen2017 in #6742
- add zero3
module_granularity_threshold
to zero optimization. by @inkcherry in #6649 - AIO File Offsets by @jomayeri in #6641
- Update path for BingBertSquad from DeepSpeedExamples by @loadams in #6746
- Sanitize inputs to eval() by @loadams in #6745
- Adding the governance doc by @minjiazhang in #6748
- Add no_sync context manager by @tjruwase in #6675
- Gaudi2 Nightly job for daily check by @raza-sikander in #6753
- Disable failing python tests by @loadams in #6758
- A faster and more memory-efficient implementation of
zero_to_fp32
by @xu-song in #6658 - Pin transformers version to work around latest torch requirements by @loadams in #6759
- make xpu ops compatible with oneapi 2025.0 by @baodii in #6760
- Add explicit parameters for torch.load by @loadams in #6751
- Fix setup.py bash cmd generation to correctly extract git info by @nelyahu in #6762
- Use
json_schema_extra
instead of extra keyword inField
by @qgallouedec in #6764 - Fix potential memory issues when use deepspeed Z3 by @wenbinc-Bin in #6726
- Removes unnecessary cloning by @swigls in #6761
- Enable torch compile on _allgather_params by @deepcharm in #6769
- Unpin with latest transformers fixes by @loadams in #6763
- docs: fix HF links by @imba-tjd in #6780
- Fix Doc Error: ZeRO Stage 2 gradient partitioning by @yewentao256 in #6775
- Cleanup code docs warnings by @loadams in #6783
- Domino Blog by @GuanhuaWang in #6776
- Update version.txt before release by @loadams in #6784
- Revert release workflow by @loadams in #6785
New Contributors
- @zhangsmallshark made their first contribution in #6733
- @hwchen2017 made their first contribution in #6742
- @minjiazhang made their first contribution in #6748
- @qgallouedec made their first contribution in #6764
- @wenbinc-Bin made their first contribution in #6726
- @swigls made their first contribution in #6761
- @imba-tjd made their first contribution in #6780
- @yewentao256 made their first contribution in #6775
Full Changelog: v0.15.4...v0.16.0