The Fine-Tuning Index / RLHF & Preference / #94

hhnqqq/MyTransformers

by hhnqqq · RLHF & Preference · updated 2mo ago

This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel strategies and a rich collection of LoRA variants. It serves as a flexible and efficient model fine-tuning toolkit for researchers and developers. Please contact hehn@mail.ustc.edu.cn for detailed information.

momentum

stars

forks

#94

rank

View on GitHub →

hhnqqq/MyTransformers

More in RLHF & Preference