How Validate Data fits into a Paperclip company.

Validate Data drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md383 linesmarkdown
Expand
1---2name: validate-data3description: QA an analysis before sharing -- methodology, accuracy, and bias checks. Use when reviewing an analysis before a stakeholder presentation, spot-checking calculations and aggregation logic, verifying a SQL query's results look right, or assessing whether conclusions are actually supported by the data.4argument-hint: "<analysis to review>"5---6 7# /validate-data - Validate Analysis Before Sharing8 9> If you see unfamiliar placeholders or need to check which tools are connected, see [CONNECTORS.md](../../CONNECTORS.md).10 11Review an analysis for accuracy, methodology, and potential biases before sharing with stakeholders. Generates a confidence assessment and improvement suggestions.12 13## Usage14 15```16/validate-data <analysis to review>17```18 19The analysis can be:20- A document or report in the conversation21- A file (markdown, notebook, spreadsheet)22- SQL queries and their results23- Charts and their underlying data24- A description of methodology and findings25 26## Workflow27 28### 1. Review Methodology and Assumptions29 30Examine:31 32- **Question framing**: Is the analysis answering the right question? Could the question be interpreted differently?33- **Data selection**: Are the right tables/datasets being used? Is the time range appropriate?34- **Population definition**: Is the analysis population correctly defined? Are there unintended exclusions?35- **Metric definitions**: Are metrics defined clearly and consistently? Do they match how stakeholders understand them?36- **Baseline and comparison**: Is the comparison fair? Are time periods, cohort sizes, and contexts comparable?37 38### 2. Run the Pre-Delivery QA Checklist39 40Work through the checklist below — data quality, calculation, reasonableness, and presentation checks.41 42### 3. Check for Common Analytical Pitfalls43 44Systematically review against the detailed pitfall catalog below (join explosion, survivorship bias, incomplete period comparison, denominator shifting, average of averages, timezone mismatches, selection bias).45 46### 4. Verify Calculations and Aggregations47 48Where possible, spot-check:49 50- Recalculate a few key numbers independently51- Verify that subtotals sum to totals52- Check that percentages sum to 100% (or close to it) where expected53- Confirm that YoY/MoM comparisons use the correct base periods54- Validate that filters are applied consistently across all metrics55 56Apply the result sanity-checking techniques below (magnitude checks, cross-validation, red-flag detection).57 58### 5. Assess Visualizations59 60If the analysis includes charts:61 62- Do axes start at appropriate values (zero for bar charts)?63- Are scales consistent across comparison charts?64- Do chart titles accurately describe what's shown?65- Could the visualization mislead a quick reader?66- Are there truncated axes, inconsistent intervals, or 3D effects that distort perception?67 68### 6. Evaluate Narrative and Conclusions69 70Review whether:71 72- Conclusions are supported by the data shown73- Alternative explanations are acknowledged74- Uncertainty is communicated appropriately75- Recommendations follow logically from findings76- The level of confidence matches the strength of evidence77 78### 7. Suggest Improvements79 80Provide specific, actionable suggestions:81 82- Additional analyses that would strengthen the conclusions83- Caveats or limitations that should be noted84- Better visualizations or framings for key points85- Missing context that stakeholders would want86 87### 8. Generate Confidence Assessment88 89Rate the analysis on a 3-level scale:90 91**Ready to share** -- Analysis is methodologically sound, calculations verified, caveats noted. Minor suggestions for improvement but nothing blocking.92 93**Share with noted caveats** -- Analysis is largely correct but has specific limitations or assumptions that must be communicated to stakeholders. List the required caveats.94 95**Needs revision** -- Found specific errors, methodological issues, or missing analyses that should be addressed before sharing. List the required changes with priority order.96 97## Output Format98 99```100## Validation Report101 102### Overall Assessment: [Ready to share | Share with caveats | Needs revision]103 104### Methodology Review105[Findings about approach, data selection, definitions]106 107### Issues Found1081. [Severity: High/Medium/Low] [Issue description and impact]1092. ...110 111### Calculation Spot-Checks112- [Metric]: [Verified / Discrepancy found]113- ...114 115### Visualization Review116[Any issues with charts or visual presentation]117 118### Suggested Improvements1191. [Improvement and why it matters]1202. ...121 122### Required Caveats for Stakeholders123- [Caveat that must be communicated]124- ...125```126 127---128 129## Pre-Delivery QA Checklist130 131Run through this checklist before sharing any analysis with stakeholders.132 133### Data Quality Checks134 135- [ ] **Source verification**: Confirmed which tables/data sources were used. Are they the right ones for this question?136- [ ] **Freshness**: Data is current enough for the analysis. Noted the "as of" date.137- [ ] **Completeness**: No unexpected gaps in time series or missing segments.138- [ ] **Null handling**: Checked null rates in key columns. Nulls are handled appropriately (excluded, imputed, or flagged).139- [ ] **Deduplication**: Confirmed no double-counting from bad joins or duplicate source records.140- [ ] **Filter verification**: All WHERE clauses and filters are correct. No unintended exclusions.141 142### Calculation Checks143 144- [ ] **Aggregation logic**: GROUP BY includes all non-aggregated columns. Aggregation level matches the analysis grain.145- [ ] **Denominator correctness**: Rate and percentage calculations use the right denominator. Denominators are non-zero.146- [ ] **Date alignment**: Comparisons use the same time period length. Partial periods are excluded or noted.147- [ ] **Join correctness**: JOIN types are appropriate (INNER vs LEFT). Many-to-many joins haven't inflated counts.148- [ ] **Metric definitions**: Metrics match how stakeholders define them. Any deviations are noted.149- [ ] **Subtotals sum**: Parts add up to the whole where expected. If they don't, explain why (e.g., overlap).150 151### Reasonableness Checks152 153- [ ] **Magnitude**: Numbers are in a plausible range. Revenue isn't negative. Percentages are between 0-100%.154- [ ] **Trend continuity**: No unexplained jumps or drops in time series.155- [ ] **Cross-reference**: Key numbers match other known sources (dashboards, previous reports, finance data).156- [ ] **Order of magnitude**: Total revenue is in the right ballpark. User counts match known figures.157- [ ] **Edge cases**: What happens at the boundaries? Empty segments, zero-activity periods, new entities.158 159### Presentation Checks160 161- [ ] **Chart accuracy**: Bar charts start at zero. Axes are labeled. Scales are consistent across panels.162- [ ] **Number formatting**: Appropriate precision. Consistent currency/percentage formatting. Thousands separators where needed.163- [ ] **Title clarity**: Titles state the insight, not just the metric. Date ranges are specified.164- [ ] **Caveat transparency**: Known limitations and assumptions are stated explicitly.165- [ ] **Reproducibility**: Someone else could recreate this analysis from the documentation provided.166 167## Common Data Analysis Pitfalls168 169### Join Explosion170 171**The problem**: A many-to-many join silently multiplies rows, inflating counts and sums.172 173**How to detect**:174```sql175-- Check row count before and after join176SELECT COUNT(*) FROM table_a;  -- 1,000177SELECT COUNT(*) FROM table_a a JOIN table_b b ON a.id = b.a_id;  -- 3,500 (uh oh)178```179 180**How to prevent**:181- Always check row counts after joins182- If counts increase, investigate the join relationship (is it really 1:1 or 1:many?)183- Use `COUNT(DISTINCT a.id)` instead of `COUNT(*)` when counting entities through joins184 185### Survivorship Bias186 187**The problem**: Analyzing only entities that exist today, ignoring those that were deleted, churned, or failed.188 189**Examples**:190- Analyzing user behavior of "current users" misses churned users191- Looking at "companies using our product" ignores those who evaluated and left192- Studying properties of "successful" outcomes without "unsuccessful" ones193 194**How to prevent**: Ask "who is NOT in this dataset?" before drawing conclusions.195 196### Incomplete Period Comparison197 198**The problem**: Comparing a partial period to a full period.199 200**Examples**:201- "January revenue is $500K vs. December's $800K" -- but January isn't over yet202- "This week's signups are down" -- checked on Wednesday, comparing to a full prior week203 204**How to prevent**: Always filter to complete periods, or compare same-day-of-month / same-number-of-days.205 206### Denominator Shifting207 208**The problem**: The denominator changes between periods, making rates incomparable.209 210**Examples**:211- Conversion rate improves because you changed how you count "eligible" users212- Churn rate changes because the definition of "active" was updated213 214**How to prevent**: Use consistent definitions across all compared periods. Note any definition changes.215 216### Average of Averages217 218**The problem**: Averaging pre-computed averages gives wrong results when group sizes differ.219 220**Example**:221- Group A: 100 users, average revenue $50222- Group B: 10 users, average revenue $200223- Wrong: Average of averages = ($50 + $200) / 2 = $125224- Right: Weighted average = (100*$50 + 10*$200) / 110 = $63.64225 226**How to prevent**: Always aggregate from raw data. Never average pre-aggregated averages.227 228### Timezone Mismatches229 230**The problem**: Different data sources use different timezones, causing misalignment.231 232**Examples**:233- Event timestamps in UTC vs. user-facing dates in local time234- Daily rollups that use different cutoff times235 236**How to prevent**: Standardize all timestamps to a single timezone (UTC recommended) before analysis. Document the timezone used.237 238### Selection Bias in Segmentation239 240**The problem**: Segments are defined by the outcome you're measuring, creating circular logic.241 242**Examples**:243- "Users who completed onboarding have higher retention" -- obviously, they self-selected244- "Power users generate more revenue" -- they became power users BY generating revenue245 246**How to prevent**: Define segments based on pre-treatment characteristics, not outcomes.247 248### Other Statistical Traps249 250- **Simpson's paradox**: Trend reverses when data is aggregated vs. segmented251- **Correlation presented as causation** without supporting evidence252- **Small sample sizes** leading to unreliable conclusions253- **Outliers disproportionately affecting averages** (should medians be used instead?)254- **Multiple testing / cherry-picking** significant results255- **Look-ahead bias**: Using future information to explain past events256- **Cherry-picked time ranges** that favor a particular narrative257 258## Result Sanity Checking259 260### Magnitude Checks261 262For any key number in your analysis, verify it passes the "smell test":263 264| Metric Type | Sanity Check |265|---|---|266| User counts | Does this match known MAU/DAU figures? |267| Revenue | Is this in the right order of magnitude vs. known ARR? |268| Conversion rates | Is this between 0% and 100%? Does it match dashboard figures? |269| Growth rates | Is 50%+ MoM growth realistic, or is there a data issue? |270| Averages | Is the average reasonable given what you know about the distribution? |271| Percentages | Do segment percentages sum to ~100%? |272 273### Cross-Validation Techniques274 2751. **Calculate the same metric two different ways** and verify they match2762. **Spot-check individual records** -- pick a few specific entities and trace their data manually2773. **Compare to known benchmarks** -- match against published dashboards, finance reports, or prior analyses2784. **Reverse engineer** -- if total revenue is X, does per-user revenue times user count approximately equal X?2795. **Boundary checks** -- what happens when you filter to a single day, a single user, or a single category? Are those micro-results sensible?280 281### Red Flags That Warrant Investigation282 283- Any metric that changed by more than 50% period-over-period without an obvious cause284- Counts or sums that are exact round numbers (suggests a filter or default value issue)285- Rates exactly at 0% or 100% (may indicate incomplete data)286- Results that perfectly confirm the hypothesis (reality is usually messier)287- Identical values across time periods or segments (suggests the query is ignoring a dimension)288 289## Documentation Standards for Reproducibility290 291### Analysis Documentation Template292 293Every non-trivial analysis should include:294 295```markdown296## Analysis: [Title]297 298### Question299[The specific question being answered]300 301### Data Sources302- Table: [schema.table_name] (as of [date])303- Table: [schema.other_table] (as of [date])304- File: [filename] (source: [where it came from])305 306### Definitions307- [Metric A]: [Exactly how it's calculated]308- [Segment X]: [Exactly how membership is determined]309- [Time period]: [Start date] to [end date], [timezone]310 311### Methodology3121. [Step 1 of the analysis approach]3132. [Step 2]3143. [Step 3]315 316### Assumptions and Limitations317- [Assumption 1 and why it's reasonable]318- [Limitation 1 and its potential impact on conclusions]319 320### Key Findings3211. [Finding 1 with supporting evidence]3222. [Finding 2 with supporting evidence]323 324### SQL Queries325[All queries used, with comments]326 327### Caveats328- [Things the reader should know before acting on this]329```330 331### Code Documentation332 333For any code (SQL, Python) that may be reused:334 335```python336"""337Analysis: Monthly Cohort Retention338Author: [Name]339Date: [Date]340Data Source: events table, users table341Last Validated: [Date] -- results matched dashboard within 2%342 343Purpose:344    Calculate monthly user retention cohorts based on first activity date.345 346Assumptions:347    - "Active" means at least one event in the month348    - Excludes test/internal accounts (user_type != 'internal')349    - Uses UTC dates throughout350 351Output:352    Cohort retention matrix with cohort_month rows and months_since_signup columns.353    Values are retention rates (0-100%).354"""355```356 357### Version Control for Analyses358 359- Save queries and code in version control (git) or a shared docs system360- Note the date of the data snapshot used361- If an analysis is re-run with updated data, document what changed and why362- Link to prior versions of recurring analyses for trend comparison363 364## Examples365 366```367/validate-data Review this quarterly revenue analysis before I send it to the exec team: [analysis]368```369 370```371/validate-data Check my churn analysis -- I'm comparing Q4 churn rates to Q3 but Q4 has a shorter measurement window372```373 374```375/validate-data Here's a SQL query and its results for our conversion funnel. Does the logic look right? [query + results]376```377 378## Tips379 380- Run /validate-data before any high-stakes presentation or decision381- Even quick analyses benefit from a sanity check -- it takes a minute and can save your credibility382- If the validation finds issues, fix them and re-validate383- Share the validation output alongside your analysis to build stakeholder confidence
Related skills
Accessibility Review

Install Accessibility Review skill for Claude Code from anthropics/knowledge-work-plugins.
Account Research

Install Account Research skill for Claude Code from anthropics/knowledge-work-plugins.
Algorithmic Art

When you want to create generative art that's actually algorithmic rather than just randomized shapes, this skill follows a two-step process that works surprisi