Releases: THUDM/slime
Releases · THUDM/slime
v0.1.0
Performance Optimizations
- SGLang: FP8 + DeepEP + speculative decoding
- Megatron: all parallel strategy supports (TP, PP, VPP, EP, CP, etc) + DeepEP + CPU Adam.
- New Megatron offload strategy with better memory usage.
- Faster weight updation.
New Algorithm Supports
- GSPO
- TIS
- reinforce++ & reinforce++ base
Correctness
- CI for E2E GLM4 9B adn Qwen3 30B-A3B training
- CI for Build Conda environment