The Fine-Tuning Index / RLHF & Preference / #75

LLMBook-zh/LLMBook-zh.github.io

by LLMBook-zh · RLHF & Preference · updated 9mo ago

《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣

momentum

4,495

stars

339

forks

#75

rank

artificial-intelligencedeep-learningdeep-neural-networksdeep-reinforcement-learningfine-tuninglanguage-modellarge-language-modelsnatural-language-processingnlppretrained-models

View on GitHub →

LLMBook-zh/LLMBook-zh.github.io

More in RLHF & Preference