Install

Terminal · npx

$npx skills add https://github.com/obra/superpowers --skill test-driven-development

Works with Paperclip

How Minimal Run And Audit fits into a Paperclip company.

Minimal Run And Audit drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md47 linesmarkdown

Expand

1---2name: minimal-run-and-audit3description: Trusted-lane execution and reporting skill for README-first AI repo reproduction. Use when the task is specifically to capture or normalize evidence from the selected smoke test or documented inference or evaluation command and write standardized `repro_outputs/` files, including patch notes when repository files changed. Do not use for training execution, initial repo intake, generic environment setup, paper lookup, target selection, or end-to-end orchestration by itself.4---5 6# minimal-run-and-audit7 8## When to apply9 10- After a reproduction target and setup plan exist.11- When the main skill needs execution evidence and normalized outputs.12- When a smoke test, documented inference run, documented evaluation run, or other short non-training verification is appropriate.13- When the user already knows what command should be attempted and wants execution plus reporting only.14 15## When not to apply16 17- During initial repo scanning.18- When environment or assets are still undefined enough to make execution meaningless.19- When the task is a literature lookup rather than repository execution.20- When the user is still deciding which reproduction target should count as the main run.21 22## Clear boundaries23 24- This skill owns normalized reporting for an attempted command.25- It may receive execution evidence from the main skill or a thin helper.26- It does not choose the overall target on its own.27- It does not perform broad paper analysis.28- It does not own training startup, resume, or long-running training state.29- It should not normalize risky code edits into acceptable practice.30 31## Input expectations32 33- selected reproduction goal34- runnable commands or smoke commands35- environment and asset assumptions36- optional patch metadata37 38## Output expectations39 40- execution result summary41- standardized `repro_outputs/` files42- clear distinction between verified, partial, and blocked states43- `PATCHES.md` when repo files changed44 45## Notes46 47Use `references/reporting-policy.md`, `scripts/run_command.py`, and `scripts/write_outputs.py`.

Related skills

Env And Assets Bootstrap

When you're trying to reproduce an AI research repo and need to set up the environment before running anything, this handles the tedious bootstrap work. It gene

Paper Context Resolver

When you're reproducing an AI paper from a GitHub repo and hit a specific gap the README can't fill, this resolves narrow technical details from the original pa

Repo Intake And Plan

Takes a fresh repo and does the boring first pass: reads the README, scans for setup scripts and documented commands, then categorizes what looks like inference