How Grafana Dashboards fits into a Paperclip company.

Grafana Dashboards drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md382 linesmarkdown

Expand

1---2name: grafana-dashboards3description: Create and manage production Grafana dashboards for real-time visualization of system and application metrics. Use when building monitoring dashboards, visualizing metrics, or creating operational observability interfaces.4---5 6# Grafana Dashboards7 8Create and manage production-ready Grafana dashboards for comprehensive system observability.9 10## Purpose11 12Design effective Grafana dashboards for monitoring applications, infrastructure, and business metrics.13 14## When to Use15 16- Visualize Prometheus metrics17- Create custom dashboards18- Implement SLO dashboards19- Monitor infrastructure20- Track business KPIs21 22## Dashboard Design Principles23 24### 1. Hierarchy of Information25 26```27┌─────────────────────────────────────┐28│  Critical Metrics (Big Numbers)     │29├─────────────────────────────────────┤30│  Key Trends (Time Series)           │31├─────────────────────────────────────┤32│  Detailed Metrics (Tables/Heatmaps) │33└─────────────────────────────────────┘34```35 36### 2. RED Method (Services)37 38- **Rate** - Requests per second39- **Errors** - Error rate40- **Duration** - Latency/response time41 42### 3. USE Method (Resources)43 44- **Utilization** - % time resource is busy45- **Saturation** - Queue length/wait time46- **Errors** - Error count47 48## Dashboard Structure49 50### API Monitoring Dashboard51 52```json53{54  "dashboard": {55    "title": "API Monitoring",56    "tags": ["api", "production"],57    "timezone": "browser",58    "refresh": "30s",59    "panels": [60      {61        "title": "Request Rate",62        "type": "graph",63        "targets": [64          {65            "expr": "sum(rate(http_requests_total[5m])) by (service)",66            "legendFormat": "{{service}}"67          }68        ],69        "gridPos": { "x": 0, "y": 0, "w": 12, "h": 8 }70      },71      {72        "title": "Error Rate %",73        "type": "graph",74        "targets": [75          {76            "expr": "(sum(rate(http_requests_total{status=~\"5..\"}[5m])) / sum(rate(http_requests_total[5m]))) * 100",77            "legendFormat": "Error Rate"78          }79        ],80        "alert": {81          "conditions": [82            {83              "evaluator": { "params": [5], "type": "gt" },84              "operator": { "type": "and" },85              "query": { "params": ["A", "5m", "now"] },86              "type": "query"87            }88          ]89        },90        "gridPos": { "x": 12, "y": 0, "w": 12, "h": 8 }91      },92      {93        "title": "P95 Latency",94        "type": "graph",95        "targets": [96          {97            "expr": "histogram_quantile(0.95, sum(rate(http_request_duration_seconds_bucket[5m])) by (le, service))",98            "legendFormat": "{{service}}"99          }100        ],101        "gridPos": { "x": 0, "y": 8, "w": 24, "h": 8 }102      }103    ]104  }105}106```107 108**Reference:** See `assets/api-dashboard.json`109 110## Panel Types111 112### 1. Stat Panel (Single Value)113 114```json115{116  "type": "stat",117  "title": "Total Requests",118  "targets": [119    {120      "expr": "sum(http_requests_total)"121    }122  ],123  "options": {124    "reduceOptions": {125      "values": false,126      "calcs": ["lastNotNull"]127    },128    "orientation": "auto",129    "textMode": "auto",130    "colorMode": "value"131  },132  "fieldConfig": {133    "defaults": {134      "thresholds": {135        "mode": "absolute",136        "steps": [137          { "value": 0, "color": "green" },138          { "value": 80, "color": "yellow" },139          { "value": 90, "color": "red" }140        ]141      }142    }143  }144}145```146 147### 2. Time Series Graph148 149```json150{151  "type": "graph",152  "title": "CPU Usage",153  "targets": [154    {155      "expr": "100 - (avg by (instance) (rate(node_cpu_seconds_total{mode=\"idle\"}[5m])) * 100)"156    }157  ],158  "yaxes": [159    { "format": "percent", "max": 100, "min": 0 },160    { "format": "short" }161  ]162}163```164 165### 3. Table Panel166 167```json168{169  "type": "table",170  "title": "Service Status",171  "targets": [172    {173      "expr": "up",174      "format": "table",175      "instant": true176    }177  ],178  "transformations": [179    {180      "id": "organize",181      "options": {182        "excludeByName": { "Time": true },183        "indexByName": {},184        "renameByName": {185          "instance": "Instance",186          "job": "Service",187          "Value": "Status"188        }189      }190    }191  ]192}193```194 195### 4. Heatmap196 197```json198{199  "type": "heatmap",200  "title": "Latency Heatmap",201  "targets": [202    {203      "expr": "sum(rate(http_request_duration_seconds_bucket[5m])) by (le)",204      "format": "heatmap"205    }206  ],207  "dataFormat": "tsbuckets",208  "yAxis": {209    "format": "s"210  }211}212```213 214## Variables215 216### Query Variables217 218```json219{220  "templating": {221    "list": [222      {223        "name": "namespace",224        "type": "query",225        "datasource": "Prometheus",226        "query": "label_values(kube_pod_info, namespace)",227        "refresh": 1,228        "multi": false229      },230      {231        "name": "service",232        "type": "query",233        "datasource": "Prometheus",234        "query": "label_values(kube_service_info{namespace=\"$namespace\"}, service)",235        "refresh": 1,236        "multi": true237      }238    ]239  }240}241```242 243### Use Variables in Queries244 245```246sum(rate(http_requests_total{namespace="$namespace", service=~"$service"}[5m]))247```248 249## Alerts in Dashboards250 251```json252{253  "alert": {254    "name": "High Error Rate",255    "conditions": [256      {257        "evaluator": {258          "params": [5],259          "type": "gt"260        },261        "operator": { "type": "and" },262        "query": {263          "params": ["A", "5m", "now"]264        },265        "reducer": { "type": "avg" },266        "type": "query"267      }268    ],269    "executionErrorState": "alerting",270    "for": "5m",271    "frequency": "1m",272    "message": "Error rate is above 5%",273    "noDataState": "no_data",274    "notifications": [{ "uid": "slack-channel" }]275  }276}277```278 279## Dashboard Provisioning280 281**dashboards.yml:**282 283```yaml284apiVersion: 1285 286providers:287  - name: "default"288    orgId: 1289    folder: "General"290    type: file291    disableDeletion: false292    updateIntervalSeconds: 10293    allowUiUpdates: true294    options:295      path: /etc/grafana/dashboards296```297 298## Common Dashboard Patterns299 300### Infrastructure Dashboard301 302**Key Panels:**303 304- CPU utilization per node305- Memory usage per node306- Disk I/O307- Network traffic308- Pod count by namespace309- Node status310 311**Reference:** See `assets/infrastructure-dashboard.json`312 313### Database Dashboard314 315**Key Panels:**316 317- Queries per second318- Connection pool usage319- Query latency (P50, P95, P99)320- Active connections321- Database size322- Replication lag323- Slow queries324 325**Reference:** See `assets/database-dashboard.json`326 327### Application Dashboard328 329**Key Panels:**330 331- Request rate332- Error rate333- Response time (percentiles)334- Active users/sessions335- Cache hit rate336- Queue length337 338## Best Practices339 3401. **Start with templates** (Grafana community dashboards)3412. **Use consistent naming** for panels and variables3423. **Group related metrics** in rows3434. **Set appropriate time ranges** (default: Last 6 hours)3445. **Use variables** for flexibility3456. **Add panel descriptions** for context3467. **Configure units** correctly3478. **Set meaningful thresholds** for colors3489. **Use consistent colors** across dashboards34910. **Test with different time ranges**350 351## Dashboard as Code352 353### Terraform Provisioning354 355```hcl356resource "grafana_dashboard" "api_monitoring" {357  config_json = file("${path.module}/dashboards/api-monitoring.json")358  folder      = grafana_folder.monitoring.id359}360 361resource "grafana_folder" "monitoring" {362  title = "Production Monitoring"363}364```365 366### Ansible Provisioning367 368```yaml369- name: Deploy Grafana dashboards370  copy:371    src: "{{ item }}"372    dest: /etc/grafana/dashboards/373  with_fileglob:374    - "dashboards/*.json"375  notify: restart grafana376```377 378 379## Related Skills380 381- `prometheus-configuration` - For metric collection382- `slo-implementation` - For SLO dashboards

Related skills

Accessibility Compliance

This walks you through implementing proper WCAG 2.2 compliance with real code patterns for screen readers, keyboard navigation, and mobile accessibility. It cov

Airflow Dag Patterns

If you're building data pipelines with Airflow, this skill gives you production-ready DAG patterns that actually work in the real world. It covers TaskFlow API

Angular Migration

Migrating from AngularJS to Angular is notoriously painful, and this skill tackles the practical stuff that makes or breaks these projects. It covers hybrid app