The Fine-Tuning Index / RLHF & Preference / #94
hhnqqq/MyTransformers
by hhnqqq · RLHF & Preference · updated 2mo ago
This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel strategies and a rich collection of LoRA variants. It serves as a flexible and efficient model fine-tuning toolkit for researchers and developers. Please contact hehn@mail.ustc.edu.cn for detailed information.
38
momentum
62
stars
9
forks
#94
rank