FINE‑TUNING/INDEX

The Fine-Tuning Index / RLHF & Preference / #122

TUDB-Labs/mLoRA

by TUDB-Labs · RLHF & Preference · updated 1y ago

An Efficient "Factory" to Build Multiple LoRA Adapters

29

momentum

379

stars

67

forks

#122

rank

baichuanchatglmdpofinetunegpullamallama2llmloramlorapeftrlhf

View on GitHub →