Skill Bundles

Read the source. Install what you trust.

Each skill bundle packages a reusable agent behavior — a prompt, supporting files, and evaluation criteria. Browse the public catalog, review the full source, then install a private copy you can edit and experiment with.

Published bundles

109

Total installs

Average quality

70/100

Open My Skills

Browse bundles

109 published bundles ready to inspect and install

Skill bundlev1.0.0

Domain Specific Eval Design

Build evals for specialized verticals (legal, medical, finance, engineering)

0 installs

70/100 quality

Compatibility not listed

Inspect bundle

Skill bundlev1.0.0

Eval Contamination Prevention

Ensure training data and eval data don't overlap

0 installs

70/100 quality

Compatibility not listed

Inspect bundle

Skill bundlev1.0.0

Adversarial Eval Generation

Create evals specifically designed to find failure modes and edge cases

Read the source. Install what you trust.

Browse bundles

Domain Specific Eval Design

Eval Contamination Prevention

Adversarial Eval Generation

Eval Saturation Detection

Eval Coverage Analysis

Build Fuzzy Eval

Build Deterministic Eval

Outcome VS Process Reward Tradeoff

Reward Calibration

Human Feedback Collection

Reward Hacking Detection

Reward Shaping