Skip to content

Conversation

tjruwase
Copy link
Contributor

@tjruwase tjruwase commented Feb 17, 2025

  • Extend APIs for debugging and modifying ZeRO partitioned states to NVMe offload.
  • Add vectorized update API. This is performance-critical for NVMe offloading scenarios.

@tjruwase tjruwase requested review from stas00 and GuanhuaWang and removed request for jomayeri and loadams February 17, 2025 15:49
Copy link
Contributor

@GuanhuaWang GuanhuaWang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just some minor comments for code readability.

loadams and others added 21 commits March 10, 2025 11:02
Signed-off-by: Masahiro Tanaka <[email protected]>
Signed-off-by: Olatunji Ruwase <[email protected]>
…dai/DeepSpeed into olruwase/update_nvme_offload_states
Signed-off-by: Olatunji Ruwase <[email protected]>
…speedai/DeepSpeed into olruwase/update_nvme_offload_states
@sfc-gh-truwase sfc-gh-truwase added this pull request to the merge queue May 19, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks May 19, 2025
@loadams loadams enabled auto-merge May 19, 2025 22:55
@loadams loadams added this pull request to the merge queue May 20, 2025
Merged via the queue into master with commit 0e74171 May 20, 2025
13 checks passed
@loadams loadams deleted the olruwase/update_nvme_offload_states branch May 20, 2025 01:25
deepcharm pushed a commit to deepcharm/DeepSpeed that referenced this pull request Jun 16, 2025
- Extend APIs for
[debugging](https://deepspeed.readthedocs.io/en/latest/zero3.html#debugging)
and
[modifying](https://deepspeed.readthedocs.io/en/latest/zero3.html#modifying-partitioned-states)
ZeRO partitioned states to NVMe offload.
- Add vectorized update API. This is performance-critical for NVMe
offloading scenarios.

---------

Signed-off-by: Olatunji Ruwase <[email protected]>
Signed-off-by: Masahiro Tanaka <[email protected]>
Co-authored-by: Logan Adams <[email protected]>
Co-authored-by: Logan Adams <[email protected]>
Co-authored-by: Masahiro Tanaka <[email protected]>
Co-authored-by: Masahiro Tanaka <[email protected]>
Co-authored-by: Guanhua Wang <[email protected]>
Signed-off-by: Max Kovalenko <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants