The Fine-Tuning Index / RLHF & Preference / #8

oumi-ai/oumi

by oumi-ai · RLHF & Preference · updated today

Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!

77
momentum
9,315
stars
777
forks
#8
rank
dpoevaluationfine-tuninggpt-ossgpt-oss-120bgpt-oss-20binferencellamallmssftslmsvlms
View on GitHub →