Public skill bundle

Implement RLHF Pipeline

End-to-end: collect preferences → train reward model → optimize policy

Version

v1.0.0

Quality

70/100

Installs

Files

Installs as a private draft. Your edits and self-improvement runs do not change the published bundle.

Open My Skills

What this bundle gives you

This is the published source version. Installing it creates a private copy in your workspace where you can edit, run experiments, and iterate without changing the public original.

What happens after install

The published bundle stays unchanged.

You get a private copy in your workspace with full edit access.

Evaluation results, observations, and experiments all attach to your copy.

Implement RLHF Pipeline

What this bundle gives you

What happens after install

Distribution summary