The Fine-Tuning Index / RLHF & Preference / #8
oumi-ai/oumi
by oumi-ai · RLHF & Preference · updated today
Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!
77
momentum
9,315
stars
777
forks
#8
rank
dpoevaluationfine-tuninggpt-ossgpt-oss-120bgpt-oss-20binferencellamallmssftslmsvlms
View on GitHub →