Public skill bundle

Reward Model Training

Train reward models from human preference data, handle label noise and distribution shift

Version

v1.0.0

Quality

70/100

Installs

Files

Installs as a private draft. Your edits and self-improvement runs do not change the published bundle.

Open My Skills

What this bundle gives you

This is the published source version. Installing it creates a private copy in your workspace where you can edit, run experiments, and iterate without changing the public original.

What happens after install

The published bundle stays unchanged.

You get a private copy in your workspace with full edit access.

Evaluation results, observations, and experiments all attach to your copy.

Reward Model Training

What this bundle gives you

What happens after install

Distribution summary