The Fine-Tuning Index / RLHF & Preference / #60

bcefghj/learn-MedicalGPT

by bcefghj · RLHF & Preference · updated 2mo ago

🏥 从零基础到面试通关:20节课彻底搞懂MedicalGPT医疗大模型训练全流程 | PT/SFT/LoRA/RLHF/DPO/GRPO | 100+面试高频考点

48
momentum
152
stars
11
forks
#60
rank
View on GitHub →