ml-lab¶
A Claude Code plugin that runs structured ML hypothesis investigations using an adversarial critic-defender debate protocol or an independent ensemble of critics. It enforces rigor at every step — pre-specified metrics, confidence-tiered review findings, agreed experiments only — and produces a self-contained report with a production re-evaluation.
When to use ml-lab¶
Use ml-lab when you have a falsifiable ML hypothesis and want a structured process to test it. The plugin handles the full lifecycle: sharpening the hypothesis, building a minimal proof-of-concept, adversarial review (or ensemble sweep), pre-registered experiments, and synthesis of conclusions. It is designed for rigor over speed.
Quick start¶
# Install the plugin
/plugin marketplace add chris-santiago/ml-lab
/plugin install ml-lab@ml-lab
# Start an investigation
/ml-lab
Where to go next¶
-
Tutorials
Get started with ml-lab — from installation to running your first investigation.
-
How-to Guides
Step-by-step instructions for specific tasks: running investigations, logging decisions, querying the journal.
-
Reference
Complete reference for agents, skills, entry types, scripts, and configuration.
-
Explanation
Understand why ml-lab works the way it does — the debate protocol, evaluation methodology, and lessons learned.
-
Research
The experiments behind ml-lab — eight versions of self-evaluation, the working paper, and related work.