Name: Gsd 2 Agent Framework
Author: Aradotso
Install
Terminal · npx
$npx skills add https://github.com/vercel-labs/agent-skills --skill vercel-react-best-practices
Works with Paperclip
How Gsd 2 Agent Framework fits into a Paperclip company.

Gsd 2 Agent Framework drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md510 linesmarkdown
Expand
1---2name: gsd-2-agent-framework3description: Meta-prompting, context engineering, and spec-driven development system for autonomous long-running coding agents4triggers:5  - gsd autonomous agent6  - spec-driven development7  - context engineering coding8  - long running agent task9  - gsd auto mode10  - milestone slice task hierarchy11  - gsd-pi cli agent12  - autonomous coding agent framework13---14 15# GSD 2 — Autonomous Spec-Driven Agent Framework16 17> Skill by [ara.so](https://ara.so) — Daily 2026 Skills collection18 19GSD 2 is a standalone CLI that turns a structured spec into running software autonomously. It controls the agent harness directly — managing fresh context windows per task, git worktree isolation, crash recovery, cost tracking, and stuck detection — rather than relying on LLM self-loops. One command, walk away, come back to a built project with clean git history.20 21---22 23## Installation24 25```bash26npm install -g gsd-pi27```28 29Requires Node.js 18+. Works with Claude (Anthropic) as the underlying model via the Pi SDK.30 31---32 33## Core Concepts34 35### Work Hierarchy36 37```38Milestone  →  a shippable version (4–10 slices)39  Slice    →  one demoable vertical capability (1–7 tasks)40    Task   →  one context-window-sized unit of work41```42 43**Iron rule:** A task must fit in one context window. If it can't, split it into two tasks.44 45### Directory Layout46 47```48project/49├── .gsd/50│   ├── STATE.md          # current auto-mode position51│   ├── DECISIONS.md      # architecture decisions register52│   ├── LOCK              # crash recovery lock file53│   ├── milestones/54│   │   └── M1/55│   │       ├── slices/56│   │       │   └── S1/57│   │       │       ├── PLAN.md        # task breakdown with must-haves58│   │       │       ├── RESEARCH.md    # codebase/doc scouting output59│   │       │       ├── SUMMARY.md     # completion summary60│   │       │       └── tasks/61│   │       │           └── T1/62│   │       │               ├── PLAN.md63│   │       │               └── SUMMARY.md64│   └── costs/65│       └── ledger.json   # per-unit token/cost tracking66├── ROADMAP.md            # milestone/slice structure67└── PROJECT.md            # project description and goals68```69 70---71 72## Commands73 74### `/gsd auto` — Primary Autonomous Mode75 76Run the full automation loop. Reads `.gsd/STATE.md`, dispatches each unit in a fresh session, handles recovery, and advances through the entire milestone without intervention.77 78```bash79/gsd auto80# or with options:81/gsd auto --budget 5.00        # pause if cost exceeds $582/gsd auto --milestone M1       # run only milestone 183/gsd auto --dry-run            # show dispatch plan without executing84```85 86### `/gsd init` — Initialize a Project87 88Scaffold the `.gsd/` directory from a `ROADMAP.md` and optional `PROJECT.md`.89 90```bash91/gsd init92```93 94Creates initial `STATE.md`, registers milestones and slices from your roadmap, sets up the cost ledger.95 96### `/gsd status` — Dashboard97 98Shows current position, per-slice costs, token usage, and what's queued next.99 100```bash101/gsd status102```103 104Output example:105```106Milestone 1: Auth System  [3/5 slices complete]107  ✓ S1: User model + migrations108  ✓ S2: Password auth endpoints109  ✓ S3: JWT session management110  → S4: OAuth integration  [PLANNING]111    S5: Role-based access control112 113Cost: $1.84 / $5.00 budget114Tokens: 142k input, 38k output115```116 117### `/gsd run` — Single Unit Dispatch118 119Execute one specific unit manually instead of running the full loop.120 121```bash122/gsd run --slice M1/S4            # run research + plan + execute for a slice123/gsd run --task M1/S4/T2          # run a single task124/gsd run --phase research M1/S4   # run just the research phase125/gsd run --phase plan M1/S4       # run just the planning phase126```127 128### `/gsd migrate` — Migrate from v1129 130Import old `.planning/` directories from the original Get Shit Done.131 132```bash133/gsd migrate                        # migrate current directory134/gsd migrate ~/projects/old-project # migrate specific path135```136 137### `/gsd costs` — Cost Report138 139Detailed cost breakdown with projections.140 141```bash142/gsd costs143/gsd costs --by-phase144/gsd costs --by-slice145/gsd costs --export costs.csv146```147 148---149 150## Project Setup151 152### 1. Write `ROADMAP.md`153 154```markdown155# My Project Roadmap156 157## Milestone 1: Core API158 159### S1: Database schema and migrations160Set up Postgres schema for users, posts, and comments.161 162### S2: REST endpoints163CRUD endpoints for all resources with validation.164 165### S3: Authentication166JWT-based auth with refresh tokens.167 168## Milestone 2: Frontend169 170### S1: React app scaffold171...172```173 174### 2. Write `PROJECT.md`175 176```markdown177# My Project178 179A REST API for a blogging platform built with Express + TypeScript + Postgres.180 181## Tech Stack182- Node.js 20, TypeScript 5183- Express 4184- PostgreSQL 15 via pg + kysely185- Jest for tests186 187## Conventions188- All endpoints return `{ data, error }` envelope189- Database migrations in `db/migrations/`190- Feature modules in `src/features/<name>/`191```192 193### 3. Initialize194 195```bash196/gsd init197```198 199### 4. Run200 201```bash202/gsd auto203```204 205---206 207## The Auto-Mode State Machine208 209```210Research → Plan → Execute (per task) → Complete → Reassess → Next Slice211```212 213Each phase runs in a **fresh session** with context pre-inlined into the dispatch prompt:214 215| Phase | What the LLM receives | What it produces |216|---|---|---|217| Research | PROJECT.md, ROADMAP.md, slice description, codebase index | RESEARCH.md with findings, gotchas, relevant files |218| Plan | Research output, slice description, must-haves | PLAN.md with task breakdown, verification steps |219| Execute (task N) | Task plan, prior task summaries, dependency summaries, DECISIONS.md | Working code committed to git |220| Complete | All task summaries, slice plan | SUMMARY.md, UAT script, updated ROADMAP.md |221| Reassess | Completed slice summary, full ROADMAP.md | Updated roadmap with any corrections |222 223---224 225## Must-Haves: Mechanically Verifiable Outcomes226 227Every task plan includes must-haves — explicit, checkable criteria the LLM uses to confirm completion. Write them as shell commands or file existence checks:228 229```markdown230## Must-Haves231 232- [ ] `npm test -- --testPathPattern=auth` passes with 0 failures233- [ ] File `src/features/auth/jwt.ts` exists and exports `signToken`, `verifyToken`234- [ ] `curl -X POST http://localhost:3000/auth/login` returns 200 with `{ data: { token } }`235- [ ] No TypeScript errors: `npx tsc --noEmit` exits 0236```237 238The execute phase ends only when the LLM can check off every must-have.239 240---241 242## Git Strategy243 244GSD manages git automatically in auto mode:245 246```247main248 └── milestone/M1          ← worktree branch created at start249      ├── commit: [M1/S1/T1] implement user model250      ├── commit: [M1/S1/T2] add migrations251      ├── commit: [M1/S1] slice complete252      ├── commit: [M1/S2/T1] POST /users endpoint253      └── ...254 255 After milestone complete:256main ← squash merge of milestone/M1 as "[M1] Auth system"257```258 259Each task commits with a structured message. Each slice commits a summary commit. The milestone squash-merges to main as one clean entry.260 261---262 263## Crash Recovery264 265GSD writes a lock file at `.gsd/LOCK` when a unit starts and removes it on clean completion. If the process dies:266 267```bash268# Next run detects the lock and auto-recovers:269/gsd auto270 271# Output:272# ⚠ Lock file found: M1/S3/T2 was interrupted273# Synthesizing recovery briefing from session artifacts...274# Resuming with full context275```276 277The recovery briefing is synthesized from every tool call that reached disk — file writes, shell output, partial completions — so the resumed session has context continuity.278 279---280 281## Cost Controls282 283Set a budget ceiling to pause auto mode before overspending:284 285```bash286/gsd auto --budget 10.00287```288 289The cost ledger at `.gsd/costs/ledger.json`:290 291```json292{293  "units": [294    {295      "id": "M1/S1/research",296      "model": "claude-opus-4",297      "inputTokens": 12400,298      "outputTokens": 3200,299      "costUsd": 0.21,300      "completedAt": "2025-01-15T10:23:44Z"301    }302  ],303  "totalCostUsd": 1.84,304  "budgetUsd": 10.00305}306```307 308---309 310## Decisions Register311 312`.gsd/DECISIONS.md` is auto-injected into every task dispatch. Record architectural decisions here and the LLM will respect them across all future sessions:313 314```markdown315# Decisions Register316 317## D1: Use kysely not prisma318**Date:** 2025-01-14319**Reason:** Better TypeScript inference, no code generation step needed.320**Impact:** All DB queries use kysely QueryBuilder syntax.321 322## D2: JWT in httpOnly cookie, not Authorization header323**Date:** 2025-01-14  324**Reason:** Better XSS protection for the web client.325**Impact:** Auth middleware reads `req.cookies.token`.326```327 328---329 330## Stuck Detection331 332If the same unit dispatches twice without producing its expected artifact, GSD:333 3341. Retries once with a deep diagnostic prompt that includes what was expected vs. what exists on disk3352. If the second attempt fails, **stops auto mode** and reports:336 337```338✗ Stuck on M1/S3/T1 after 2 attempts339Expected: src/features/auth/jwt.ts (not found)340Last session: .gsd/sessions/M1-S3-T1-attempt2.log341Run `/gsd run --task M1/S3/T1` to retry manually342```343 344---345 346## Skills Integration347 348GSD supports auto-detecting and installing relevant skills during the research phase. Create `SKILLS.md` in your project:349 350```markdown351# Project Skills352 353- name: postgres-kysely354- name: express-typescript  355- name: jest-testing356```357 358Skills are injected into the research and plan dispatch prompts, giving the LLM curated knowledge about your exact stack without burning context on irrelevant docs.359 360---361 362## Timeout Supervision363 364Three timeout tiers prevent runaway sessions:365 366| Timeout | Default | Behavior |367|---|---|---|368| Soft | 8 min | Sends "please wrap up" steering message |369| Idle | 3 min no tool calls | Sends "are you stuck?" recovery prompt |370| Hard | 15 min | Pauses auto mode, preserves all disk state |371 372Configure in `.gsd/config.json`:373 374```json375{376  "timeouts": {377    "softMinutes": 8,378    "idleMinutes": 3,379    "hardMinutes": 15380  },381  "defaultModel": "claude-opus-4",382  "researchModel": "claude-sonnet-4"383}384```385 386---387 388## TypeScript Integration (Pi SDK)389 390GSD is built on the [Pi SDK](https://github.com/badlogic/pi-mono). You can extend it programmatically:391 392```typescript393import { GSDProject, AutoRunner } from 'gsd-pi';394 395const project = await GSDProject.load('/path/to/project');396 397// Check current state398const state = await project.getState();399console.log(state.currentMilestone, state.currentSlice);400 401// Run a single slice programmatically402const runner = new AutoRunner(project, {403  budget: 5.00,404  onUnitComplete: (unit, cost) => {405    console.log(`Completed ${unit.id}, cost: $${cost.toFixed(3)}`);406  },407  onStuck: (unit, attempts) => {408    console.error(`Stuck on ${unit.id} after ${attempts} attempts`);409    process.exit(1);410  }411});412 413await runner.runSlice('M1/S4');414```415 416---417 418## Custom Dispatch Hooks419 420Inject custom context into any dispatch prompt:421 422```typescript423// .gsd/hooks.ts424import type { DispatchHook } from 'gsd-pi';425 426export const beforeTaskDispatch: DispatchHook = async (ctx) => {427  // Append custom context to every task dispatch428  return {429    ...ctx,430    extraContext: `431## Live API Docs432${await fetchInternalAPIDocs()}433    `434  };435};436```437 438Register in `.gsd/config.json`:439 440```json441{442  "hooks": "./hooks.ts"443}444```445 446---447 448## Roadmap Reassessment449 450After each slice completes, GSD runs a reassessment pass that may:451 452- Re-order upcoming slices based on discovered dependencies453- Split a slice that turned out larger than expected454- Mark a slice as no longer needed455- Add a new slice for discovered work456 457The LLM edits `ROADMAP.md` in place. You can review diffs with:458 459```bash460git diff ROADMAP.md461```462 463To disable reassessment:464 465```json466{467  "reassessment": false468}469```470 471---472 473## Troubleshooting474 475### Auto mode stops immediately with "no pending slices"476All slices in `ROADMAP.md` are marked `[x]`. Reset a slice: remove `[x]` from its entry and delete `.gsd/milestones/M1/slices/S3/SUMMARY.md`.477 478### LLM keeps failing must-haves479Check `.gsd/sessions/` for the last session log. Common causes: must-have references wrong file path, or test command needs environment variable. Adjust must-haves in the task's `PLAN.md` and re-run with `/gsd run --task M1/S3/T2`.480 481### Cost ceiling hit unexpectedly482The research phase on large codebases can be expensive. Set `researchModel` to a cheaper model in config, or reduce codebase index depth.483 484### Lock file left after clean exit485```bash486rm .gsd/LOCK487/gsd auto488```489 490### Git worktree conflicts491```bash492git worktree list          # see active worktrees493git worktree remove .gsd/worktrees/M1 --force494/gsd auto                  # recreates cleanly495```496 497### Session file too large for recovery498If `.gsd/sessions/` grows large, GSD compresses sessions older than 24h automatically. Manual cleanup:499```bash500/gsd cleanup --sessions --older-than 7d501```502 503---504 505## Links506 507- [GitHub: gsd-build/GSD-2](https://github.com/gsd-build/GSD-2)508- [npm: gsd-pi](https://www.npmjs.com/package/gsd-pi)509- [Pi SDK](https://github.com/badlogic/pi-mono)510- [Original GSD v1](https://github.com/gsd-build/get-shit-done)
Related skills
Agency Agents Ai Specialists

Install Agency Agents Ai Specialists skill for Claude Code from aradotso/trending-skills.
Agent Browser Automation

Install Agent Browser Automation skill for Claude Code from aradotso/trending-skills.
Antigravity Manager

Install Antigravity Manager skill for Claude Code from aradotso/trending-skills.