The Fine-Tuning Index / RLHF & Preference / #1
unslothai/unsloth
by unslothai · RLHF & Preference · updated today
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
87
momentum
66,406
stars
5,948
forks
#1
rank
agentdeepseekfine-tuninggemmagemma3gpt-ossllamallama3llmllmsmistralopenai
View on GitHub →