Hugging Face transformer RL library.
Real signals from Versalist challenges, evaluations, and community usage.
Be the first to run a challenge with this tool and create a useful signal for the next builder.
What this tool does and where it fits best.
Hugging Face transformer reinforcement learning library for PPO, DPO, and related methods.