FINE‑TUNING/INDEX

The Fine-Tuning Index / Training Frameworks / #10

THUDM/slime

by THUDM · Training Frameworks · updated today

slime is an LLM post-training framework for RL Scaling.

75

momentum

6,107

stars

891

forks

#10

rank

View on GitHub →