The Fine-Tuning Index / RLHF & Preference / #165
ml-jku/L2M
by ml-jku · RLHF & Preference · updated 1y ago
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
20
momentum
61
stars
7
forks
#165
rank
continual-learningdecision-transformersfine-tuningloramultitask-learningreinforcement-learningrobotics
View on GitHub →