The Fine-Tuning Index / RLHF & Preference / #122
TUDB-Labs/mLoRA
by TUDB-Labs · RLHF & Preference · updated 1y ago
An Efficient "Factory" to Build Multiple LoRA Adapters
29
momentum
379
stars
67
forks
#122
rank
baichuanchatglmdpofinetunegpullamallama2llmloramlorapeftrlhf
View on GitHub →