The Fine-Tuning Index / RLHF & Preference / #1

unslothai/unsloth

by unslothai · RLHF & Preference · updated today

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

momentum

66,406

stars

5,948

forks

rank

agentdeepseekfine-tuninggemmagemma3gpt-ossllamallama3llmllmsmistralopenai

More in RLHF & Preference