The Fine-Tuning Index / Training Frameworks / #10
THUDM/slime
by THUDM · Training Frameworks · updated today
slime is an LLM post-training framework for RL Scaling.
75
momentum
6,107
stars
891
forks
#10
rank
slime is an LLM post-training framework for RL Scaling.