Skip to content

Releases: THUDM/slime

v0.1.0

31 Aug 16:35
261ecee
Compare
Choose a tag to compare

Performance Optimizations

  • SGLang: FP8 + DeepEP + speculative decoding
  • Megatron: all parallel strategy supports (TP, PP, VPP, EP, CP, etc) + DeepEP + CPU Adam.
  • New Megatron offload strategy with better memory usage.
  • Faster weight updation.

New Algorithm Supports

  • GSPO
  • TIS
  • reinforce++ & reinforce++ base

Correctness

  • CI for E2E GLM4 9B adn Qwen3 30B-A3B training
  • CI for Build Conda environment