The Fine-Tuning Index / RLHF & Preference / #1

unslothai/unsloth

by unslothai · RLHF & Preference · updated today

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

87
momentum
66,406
stars
5,948
forks
#1
rank
agentdeepseekfine-tuninggemmagemma3gpt-ossllamallama3llmllmsmistralopenai
View on GitHub →