Last released May 17, 2024
Parameter-Efficient Fine-Tuning (PEFT)
Last released Apr 22, 2024
Train transformer language models with reinforcement learning.
Supported by