The Fine-Tuning Index / RLHF & Preference / #165

ml-jku/L2M

by ml-jku · RLHF & Preference · updated 1y ago

Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)

20
momentum
61
stars
7
forks
#165
rank
continual-learningdecision-transformersfine-tuningloramultitask-learningreinforcement-learningrobotics
View on GitHub →