The Fine-Tuning Index / RLHF & Preference / #60
bcefghj/learn-MedicalGPT
by bcefghj · RLHF & Preference · updated 2mo ago
🏥 从零基础到面试通关:20节课彻底搞懂MedicalGPT医疗大模型训练全流程 | PT/SFT/LoRA/RLHF/DPO/GRPO | 100+面试高频考点
48
momentum
152
stars
11
forks
#60
rank