Name: Systematic Debugging
Author: Obra

Install

Terminal · npx

$npx skills add https://github.com/obra/superpowers --skill systematic-debugging

Works with Paperclip

How Systematic Debugging fits into a Paperclip company.

Systematic Debugging drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md296 linesmarkdown

Expand

1---2name: systematic-debugging3description: Use when encountering any bug, test failure, or unexpected behavior, before proposing fixes4---5 6# Systematic Debugging7 8## Overview9 10Random fixes waste time and create new bugs. Quick patches mask underlying issues.11 12**Core principle:** ALWAYS find root cause before attempting fixes. Symptom fixes are failure.13 14**Violating the letter of this process is violating the spirit of debugging.**15 16## The Iron Law17 18```19NO FIXES WITHOUT ROOT CAUSE INVESTIGATION FIRST20```21 22If you haven't completed Phase 1, you cannot propose fixes.23 24## When to Use25 26Use for ANY technical issue:27- Test failures28- Bugs in production29- Unexpected behavior30- Performance problems31- Build failures32- Integration issues33 34**Use this ESPECIALLY when:**35- Under time pressure (emergencies make guessing tempting)36- "Just one quick fix" seems obvious37- You've already tried multiple fixes38- Previous fix didn't work39- You don't fully understand the issue40 41**Don't skip when:**42- Issue seems simple (simple bugs have root causes too)43- You're in a hurry (rushing guarantees rework)44- Manager wants it fixed NOW (systematic is faster than thrashing)45 46## The Four Phases47 48You MUST complete each phase before proceeding to the next.49 50### Phase 1: Root Cause Investigation51 52**BEFORE attempting ANY fix:**53 541. **Read Error Messages Carefully**55   - Don't skip past errors or warnings56   - They often contain the exact solution57   - Read stack traces completely58   - Note line numbers, file paths, error codes59 602. **Reproduce Consistently**61   - Can you trigger it reliably?62   - What are the exact steps?63   - Does it happen every time?64   - If not reproducible → gather more data, don't guess65 663. **Check Recent Changes**67   - What changed that could cause this?68   - Git diff, recent commits69   - New dependencies, config changes70   - Environmental differences71 724. **Gather Evidence in Multi-Component Systems**73 74   **WHEN system has multiple components (CI → build → signing, API → service → database):**75 76   **BEFORE proposing fixes, add diagnostic instrumentation:**77   ```78   For EACH component boundary:79     - Log what data enters component80     - Log what data exits component81     - Verify environment/config propagation82     - Check state at each layer83 84   Run once to gather evidence showing WHERE it breaks85   THEN analyze evidence to identify failing component86   THEN investigate that specific component87   ```88 89   **Example (multi-layer system):**90   ```bash91   # Layer 1: Workflow92   echo "=== Secrets available in workflow: ==="93   echo "IDENTITY: ${IDENTITY:+SET}${IDENTITY:-UNSET}"94 95   # Layer 2: Build script96   echo "=== Env vars in build script: ==="97   env | grep IDENTITY || echo "IDENTITY not in environment"98 99   # Layer 3: Signing script100   echo "=== Keychain state: ==="101   security list-keychains102   security find-identity -v103 104   # Layer 4: Actual signing105   codesign --sign "$IDENTITY" --verbose=4 "$APP"106   ```107 108   **This reveals:** Which layer fails (secrets → workflow ✓, workflow → build ✗)109 1105. **Trace Data Flow**111 112   **WHEN error is deep in call stack:**113 114   See `root-cause-tracing.md` in this directory for the complete backward tracing technique.115 116   **Quick version:**117   - Where does bad value originate?118   - What called this with bad value?119   - Keep tracing up until you find the source120   - Fix at source, not at symptom121 122### Phase 2: Pattern Analysis123 124**Find the pattern before fixing:**125 1261. **Find Working Examples**127   - Locate similar working code in same codebase128   - What works that's similar to what's broken?129 1302. **Compare Against References**131   - If implementing pattern, read reference implementation COMPLETELY132   - Don't skim - read every line133   - Understand the pattern fully before applying134 1353. **Identify Differences**136   - What's different between working and broken?137   - List every difference, however small138   - Don't assume "that can't matter"139 1404. **Understand Dependencies**141   - What other components does this need?142   - What settings, config, environment?143   - What assumptions does it make?144 145### Phase 3: Hypothesis and Testing146 147**Scientific method:**148 1491. **Form Single Hypothesis**150   - State clearly: "I think X is the root cause because Y"151   - Write it down152   - Be specific, not vague153 1542. **Test Minimally**155   - Make the SMALLEST possible change to test hypothesis156   - One variable at a time157   - Don't fix multiple things at once158 1593. **Verify Before Continuing**160   - Did it work? Yes → Phase 4161   - Didn't work? Form NEW hypothesis162   - DON'T add more fixes on top163 1644. **When You Don't Know**165   - Say "I don't understand X"166   - Don't pretend to know167   - Ask for help168   - Research more169 170### Phase 4: Implementation171 172**Fix the root cause, not the symptom:**173 1741. **Create Failing Test Case**175   - Simplest possible reproduction176   - Automated test if possible177   - One-off test script if no framework178   - MUST have before fixing179   - Use the `superpowers:test-driven-development` skill for writing proper failing tests180 1812. **Implement Single Fix**182   - Address the root cause identified183   - ONE change at a time184   - No "while I'm here" improvements185   - No bundled refactoring186 1873. **Verify Fix**188   - Test passes now?189   - No other tests broken?190   - Issue actually resolved?191 1924. **If Fix Doesn't Work**193   - STOP194   - Count: How many fixes have you tried?195   - If < 3: Return to Phase 1, re-analyze with new information196   - **If ≥ 3: STOP and question the architecture (step 5 below)**197   - DON'T attempt Fix #4 without architectural discussion198 1995. **If 3+ Fixes Failed: Question Architecture**200 201   **Pattern indicating architectural problem:**202   - Each fix reveals new shared state/coupling/problem in different place203   - Fixes require "massive refactoring" to implement204   - Each fix creates new symptoms elsewhere205 206   **STOP and question fundamentals:**207   - Is this pattern fundamentally sound?208   - Are we "sticking with it through sheer inertia"?209   - Should we refactor architecture vs. continue fixing symptoms?210 211   **Discuss with your human partner before attempting more fixes**212 213   This is NOT a failed hypothesis - this is a wrong architecture.214 215## Red Flags - STOP and Follow Process216 217If you catch yourself thinking:218- "Quick fix for now, investigate later"219- "Just try changing X and see if it works"220- "Add multiple changes, run tests"221- "Skip the test, I'll manually verify"222- "It's probably X, let me fix that"223- "I don't fully understand but this might work"224- "Pattern says X but I'll adapt it differently"225- "Here are the main problems: [lists fixes without investigation]"226- Proposing solutions before tracing data flow227- **"One more fix attempt" (when already tried 2+)**228- **Each fix reveals new problem in different place**229 230**ALL of these mean: STOP. Return to Phase 1.**231 232**If 3+ fixes failed:** Question the architecture (see Phase 4.5)233 234## your human partner's Signals You're Doing It Wrong235 236**Watch for these redirections:**237- "Is that not happening?" - You assumed without verifying238- "Will it show us...?" - You should have added evidence gathering239- "Stop guessing" - You're proposing fixes without understanding240- "Ultrathink this" - Question fundamentals, not just symptoms241- "We're stuck?" (frustrated) - Your approach isn't working242 243**When you see these:** STOP. Return to Phase 1.244 245## Common Rationalizations246 247| Excuse | Reality |248|--------|---------|249| "Issue is simple, don't need process" | Simple issues have root causes too. Process is fast for simple bugs. |250| "Emergency, no time for process" | Systematic debugging is FASTER than guess-and-check thrashing. |251| "Just try this first, then investigate" | First fix sets the pattern. Do it right from the start. |252| "I'll write test after confirming fix works" | Untested fixes don't stick. Test first proves it. |253| "Multiple fixes at once saves time" | Can't isolate what worked. Causes new bugs. |254| "Reference too long, I'll adapt the pattern" | Partial understanding guarantees bugs. Read it completely. |255| "I see the problem, let me fix it" | Seeing symptoms ≠ understanding root cause. |256| "One more fix attempt" (after 2+ failures) | 3+ failures = architectural problem. Question pattern, don't fix again. |257 258## Quick Reference259 260| Phase | Key Activities | Success Criteria |261|-------|---------------|------------------|262| **1. Root Cause** | Read errors, reproduce, check changes, gather evidence | Understand WHAT and WHY |263| **2. Pattern** | Find working examples, compare | Identify differences |264| **3. Hypothesis** | Form theory, test minimally | Confirmed or new hypothesis |265| **4. Implementation** | Create test, fix, verify | Bug resolved, tests pass |266 267## When Process Reveals "No Root Cause"268 269If systematic investigation reveals issue is truly environmental, timing-dependent, or external:270 2711. You've completed the process2722. Document what you investigated2733. Implement appropriate handling (retry, timeout, error message)2744. Add monitoring/logging for future investigation275 276**But:** 95% of "no root cause" cases are incomplete investigation.277 278## Supporting Techniques279 280These techniques are part of systematic debugging and available in this directory:281 282- **`root-cause-tracing.md`** - Trace bugs backward through call stack to find original trigger283- **`defense-in-depth.md`** - Add validation at multiple layers after finding root cause284- **`condition-based-waiting.md`** - Replace arbitrary timeouts with condition polling285 286**Related skills:**287- **superpowers:test-driven-development** - For creating failing test case (Phase 4, Step 1)288- **superpowers:verification-before-completion** - Verify fix worked before claiming success289 290## Real-World Impact291 292From debugging sessions:293- Systematic approach: 15-30 minutes to fix294- Random fixes approach: 2-3 hours of thrashing295- First-time fix rate: 95% vs 40%296- New bugs introduced: Near zero vs common

Related skills

Brainstorming

Forces Claude to actually think before coding, which frankly should be the default but isn't. Blocks all implementation until you've walked through context expl

Dispatching Parallel Agents

When you've got multiple independent failures across different test files or subsystems, this cuts debugging time by running separate agents in parallel instead

Executing Plans

Takes a written implementation plan and executes it step by step with proper checkpoints. Loads the plan file, reviews it critically for gaps or unclear instruc