The Fine-Tuning Index / RLHF & Preference / #36

agentscope-ai/Trinity-RFT

by agentscope-ai · RLHF & Preference · updated 3d ago

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

63
momentum
650
stars
71
forks
#36
rank
agentllmrlhf
View on GitHub →