The Fine-Tuning Index / Training Frameworks / #10

THUDM/slime

by THUDM · Training Frameworks · updated today

slime is an LLM post-training framework for RL Scaling.

75
momentum
6,107
stars
891
forks
#10
rank
View on GitHub →