The Fine-Tuning Index / RLHF & Preference / #36
agentscope-ai/Trinity-RFT
by agentscope-ai · RLHF & Preference · updated 3d ago
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).
63
momentum
650
stars
71
forks
#36
rank
agentllmrlhf
View on GitHub →