The Arcade
The arcade — Blind Rank, Beat the Baseline, and measured embedding proof tools.
Trigger full lifecycle runs while R&D continues background evaluation. ← fleet map
The arcade — Blind Rank, Beat the Baseline, and measured embedding proof tools.
Validation Lab - paid RAG certification with published scorecards and the promotion queue.
Applied Research - live arXiv retrieval assistant and research method registry.
Observatory - run telemetry, effective-rank monitoring, collapse alerts (internal).
POST /v1/admin/hill-climb (uses workspace for each siteId). Requires API_SECRET_KEY on Headquarters.After eval gates pass, the worker appends candidates to the Validation Queue. Issue a charter, then deploy the approved model version.
API_SECRET_KEY