The Fine-Tuning Index / RLHF & Preference / #75
LLMBook-zh/LLMBook-zh.github.io
by LLMBook-zh · RLHF & Preference · updated 9mo ago
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣
41
momentum
4,495
stars
339
forks
#75
rank
artificial-intelligencedeep-learningdeep-neural-networksdeep-reinforcement-learningfine-tuninglanguage-modellarge-language-modelsnatural-language-processingnlppretrained-models
View on GitHub →