Works with Paperclip
How Videodb fits into a Paperclip company.

Videodb drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md374 linesmarkdown
Expand
1---2name: videodb3description: See, Understand, Act on video and audio. See- ingest from local files, URLs, RTSP/live feeds, or live record desktop; return realtime context and playable stream links. Understand- extract frames, build visual/semantic/temporal indexes, and search moments with timestamps and auto-clips. Act- transcode and normalize (codec, fps, resolution, aspect ratio), perform timeline edits (subtitles, text/image overlays, branding, audio overlays, dubbing, translation), generate media assets (image, audio, video), and create real time alerts for events from live streams or desktop capture.4origin: ECC5allowed-tools: Read Grep Glob Bash(python:*)6argument-hint: "[task description]"7---8 9# VideoDB Skill10 11**Perception + memory + actions for video, live streams, and desktop sessions.**12 13## When to use14 15### Desktop Perception16- Start/stop a **desktop session** capturing **screen, mic, and system audio**17- Stream **live context** and store **episodic session memory**18- Run **real-time alerts/triggers** on what's spoken and what's happening on screen19- Produce **session summaries**, a searchable timeline, and **playable evidence links**20 21### Video ingest + stream22- Ingest a **file or URL** and return a **playable web stream link**23- Transcode/normalize: **codec, bitrate, fps, resolution, aspect ratio**24 25### Index + search (timestamps + evidence)26- Build **visual**, **spoken**, and **keyword** indexes27- Search and return exact moments with **timestamps** and **playable evidence**28- Auto-create **clips** from search results29 30### Timeline editing + generation31- Subtitles: **generate**, **translate**, **burn-in**32- Overlays: **text/image/branding**, motion captions33- Audio: **background music**, **voiceover**, **dubbing**34- Programmatic composition and exports via **timeline operations**35 36### Live streams (RTSP) + monitoring37- Connect **RTSP/live feeds**38- Run **real-time visual and spoken understanding** and emit **events/alerts** for monitoring workflows39 40## How it works41 42### Common inputs43- Local **file path**, public **URL**, or **RTSP URL**44- Desktop capture request: **start / stop / summarize session**45- Desired operations: get context for understanding, transcode spec, index spec, search query, clip ranges, timeline edits, alert rules46 47### Common outputs48- **Stream URL**49- Search results with **timestamps** and **evidence links**50- Generated assets: subtitles, audio, images, clips51- **Event/alert payloads** for live streams52- Desktop **session summaries** and memory entries53 54### Running Python code55 56Before running any VideoDB code, change to the project directory and load environment variables:57 58```python59from dotenv import load_dotenv60load_dotenv(".env")61 62import videodb63conn = videodb.connect()64```65 66This reads `VIDEO_DB_API_KEY` from:671. Environment (if already exported)682. Project's `.env` file in current directory69 70If the key is missing, `videodb.connect()` raises `AuthenticationError` automatically.71 72Do NOT write a script file when a short inline command works.73 74When writing inline Python (`python -c "..."`), always use properly formatted code — use semicolons to separate statements and keep it readable. For anything longer than ~3 statements, use a heredoc instead:75 76```bash77python << 'EOF'78from dotenv import load_dotenv79load_dotenv(".env")80 81import videodb82conn = videodb.connect()83coll = conn.get_collection()84print(f"Videos: {len(coll.get_videos())}")85EOF86```87 88### Setup89 90When the user asks to "setup videodb" or similar:91 92### 1. Install SDK93 94```bash95pip install "videodb[capture]" python-dotenv96```97 98If `videodb[capture]` fails on Linux, install without the capture extra:99 100```bash101pip install videodb python-dotenv102```103 104### 2. Configure API key105 106The user must set `VIDEO_DB_API_KEY` using **either** method:107 108- **Export in terminal** (before starting Claude): `export VIDEO_DB_API_KEY=your-key`109- **Project `.env` file**: Save `VIDEO_DB_API_KEY=your-key` in the project's `.env` file110 111Get a free API key at [console.videodb.io](https://console.videodb.io) (50 free uploads, no credit card).112 113**Do NOT** read, write, or handle the API key yourself. Always let the user set it.114 115### Quick Reference116 117### Upload media118 119```python120# URL121video = coll.upload(url="https://example.com/video.mp4")122 123# YouTube124video = coll.upload(url="https://www.youtube.com/watch?v=VIDEO_ID")125 126# Local file127video = coll.upload(file_path="/path/to/video.mp4")128```129 130### Transcript + subtitle131 132```python133# force=True skips the error if the video is already indexed134video.index_spoken_words(force=True)135text = video.get_transcript_text()136stream_url = video.add_subtitle()137```138 139### Search inside videos140 141```python142from videodb.exceptions import InvalidRequestError143 144video.index_spoken_words(force=True)145 146# search() raises InvalidRequestError when no results are found.147# Always wrap in try/except and treat "No results found" as empty.148try:149    results = video.search("product demo")150    shots = results.get_shots()151    stream_url = results.compile()152except InvalidRequestError as e:153    if "No results found" in str(e):154        shots = []155    else:156        raise157```158 159### Scene search160 161```python162import re163from videodb import SearchType, IndexType, SceneExtractionType164from videodb.exceptions import InvalidRequestError165 166# index_scenes() has no force parameter — it raises an error if a scene167# index already exists. Extract the existing index ID from the error.168try:169    scene_index_id = video.index_scenes(170        extraction_type=SceneExtractionType.shot_based,171        prompt="Describe the visual content in this scene.",172    )173except Exception as e:174    match = re.search(r"id\s+([a-f0-9]+)", str(e))175    if match:176        scene_index_id = match.group(1)177    else:178        raise179 180# Use score_threshold to filter low-relevance noise (recommended: 0.3+)181try:182    results = video.search(183        query="person writing on a whiteboard",184        search_type=SearchType.semantic,185        index_type=IndexType.scene,186        scene_index_id=scene_index_id,187        score_threshold=0.3,188    )189    shots = results.get_shots()190    stream_url = results.compile()191except InvalidRequestError as e:192    if "No results found" in str(e):193        shots = []194    else:195        raise196```197 198### Timeline editing199 200**Important:** Always validate timestamps before building a timeline:201- `start` must be >= 0 (negative values are silently accepted but produce broken output)202- `start` must be < `end`203- `end` must be <= `video.length`204 205```python206from videodb.timeline import Timeline207from videodb.asset import VideoAsset, TextAsset, TextStyle208 209timeline = Timeline(conn)210timeline.add_inline(VideoAsset(asset_id=video.id, start=10, end=30))211timeline.add_overlay(0, TextAsset(text="The End", duration=3, style=TextStyle(fontsize=36)))212stream_url = timeline.generate_stream()213```214 215### Transcode video (resolution / quality change)216 217```python218from videodb import TranscodeMode, VideoConfig, AudioConfig219 220# Change resolution, quality, or aspect ratio server-side221job_id = conn.transcode(222    source="https://example.com/video.mp4",223    callback_url="https://example.com/webhook",224    mode=TranscodeMode.economy,225    video_config=VideoConfig(resolution=720, quality=23, aspect_ratio="16:9"),226    audio_config=AudioConfig(mute=False),227)228```229 230### Reframe aspect ratio (for social platforms)231 232**Warning:** `reframe()` is a slow server-side operation. For long videos it can take233several minutes and may time out. Best practices:234- Always limit to a short segment using `start`/`end` when possible235- For full-length videos, use `callback_url` for async processing236- Trim the video on a `Timeline` first, then reframe the shorter result237 238```python239from videodb import ReframeMode240 241# Always prefer reframing a short segment:242reframed = video.reframe(start=0, end=60, target="vertical", mode=ReframeMode.smart)243 244# Async reframe for full-length videos (returns None, result via webhook):245video.reframe(target="vertical", callback_url="https://example.com/webhook")246 247# Presets: "vertical" (9:16), "square" (1:1), "landscape" (16:9)248reframed = video.reframe(start=0, end=60, target="square")249 250# Custom dimensions251reframed = video.reframe(start=0, end=60, target={"width": 1280, "height": 720})252```253 254### Generative media255 256```python257image = coll.generate_image(258    prompt="a sunset over mountains",259    aspect_ratio="16:9",260)261```262 263## Error handling264 265```python266from videodb.exceptions import AuthenticationError, InvalidRequestError267 268try:269    conn = videodb.connect()270except AuthenticationError:271    print("Check your VIDEO_DB_API_KEY")272 273try:274    video = coll.upload(url="https://example.com/video.mp4")275except InvalidRequestError as e:276    print(f"Upload failed: {e}")277```278 279### Common pitfalls280 281| Scenario | Error message | Solution |282|----------|--------------|----------|283| Indexing an already-indexed video | `Spoken word index for video already exists` | Use `video.index_spoken_words(force=True)` to skip if already indexed |284| Scene index already exists | `Scene index with id XXXX already exists` | Extract the existing `scene_index_id` from the error with `re.search(r"id\s+([a-f0-9]+)", str(e))` |285| Search finds no matches | `InvalidRequestError: No results found` | Catch the exception and treat as empty results (`shots = []`) |286| Reframe times out | Blocks indefinitely on long videos | Use `start`/`end` to limit segment, or pass `callback_url` for async |287| Negative timestamps on Timeline | Silently produces broken stream | Always validate `start >= 0` before creating `VideoAsset` |288| `generate_video()` / `create_collection()` fails | `Operation not allowed` or `maximum limit` | Plan-gated features — inform the user about plan limits |289 290## Examples291 292### Canonical prompts293- "Start desktop capture and alert when a password field appears."294- "Record my session and produce an actionable summary when it ends."295- "Ingest this file and return a playable stream link."296- "Index this folder and find every scene with people, return timestamps."297- "Generate subtitles, burn them in, and add light background music."298- "Connect this RTSP URL and alert when a person enters the zone."299 300### Screen Recording (Desktop Capture)301 302Use `ws_listener.py` to capture WebSocket events during recording sessions. Desktop capture supports **macOS** only.303 304#### Quick Start305 3061. **Choose state dir**: `STATE_DIR="${VIDEODB_EVENTS_DIR:-$HOME/.local/state/videodb}"`3072. **Start listener**: `VIDEODB_EVENTS_DIR="$STATE_DIR" python scripts/ws_listener.py --clear "$STATE_DIR" &`3083. **Get WebSocket ID**: `cat "$STATE_DIR/videodb_ws_id"`3094. **Run capture code** (see reference/capture.md for the full workflow)3105. **Events written to**: `$STATE_DIR/videodb_events.jsonl`311 312Use `--clear` whenever you start a fresh capture run so stale transcript and visual events do not leak into the new session.313 314#### Query Events315 316```python317import json318import os319import time320from pathlib import Path321 322events_dir = Path(os.environ.get("VIDEODB_EVENTS_DIR", Path.home() / ".local" / "state" / "videodb"))323events_file = events_dir / "videodb_events.jsonl"324events = []325 326if events_file.exists():327    with events_file.open(encoding="utf-8") as handle:328        for line in handle:329            try:330                events.append(json.loads(line))331            except json.JSONDecodeError:332                continue333 334transcripts = [e["data"]["text"] for e in events if e.get("channel") == "transcript"]335cutoff = time.time() - 300336recent_visual = [337    e for e in events338    if e.get("channel") == "visual_index" and e["unix_ts"] > cutoff339]340```341 342## Additional docs343 344Reference documentation is in the `reference/` directory adjacent to this SKILL.md file. Use the Glob tool to locate it if needed.345 346- [reference/api-reference.md](reference/api-reference.md) - Complete VideoDB Python SDK API reference347- [reference/search.md](reference/search.md) - In-depth guide to video search (spoken word and scene-based)348- [reference/editor.md](reference/editor.md) - Timeline editing, assets, and composition349- [reference/streaming.md](reference/streaming.md) - HLS streaming and instant playback350- [reference/generative.md](reference/generative.md) - AI-powered media generation (images, video, audio)351- [reference/rtstream.md](reference/rtstream.md) - Live stream ingestion workflow (RTSP/RTMP)352- [reference/rtstream-reference.md](reference/rtstream-reference.md) - RTStream SDK methods and AI pipelines353- [reference/capture.md](reference/capture.md) - Desktop capture workflow354- [reference/capture-reference.md](reference/capture-reference.md) - Capture SDK and WebSocket events355- [reference/use-cases.md](reference/use-cases.md) - Common video processing patterns and examples356 357**Do not use ffmpeg, moviepy, or local encoding tools** when VideoDB supports the operation. The following are all handled server-side by VideoDB — trimming, combining clips, overlaying audio or music, adding subtitles, text/image overlays, transcoding, resolution changes, aspect-ratio conversion, resizing for platform requirements, transcription, and media generation. Only fall back to local tools for operations listed under Limitations in reference/editor.md (transitions, speed changes, crop/zoom, colour grading, volume mixing).358 359### When to use what360 361| Problem | VideoDB solution |362|---------|-----------------|363| Platform rejects video aspect ratio or resolution | `video.reframe()` or `conn.transcode()` with `VideoConfig` |364| Need to resize video for Twitter/Instagram/TikTok | `video.reframe(target="vertical")` or `target="square"` |365| Need to change resolution (e.g. 1080p → 720p) | `conn.transcode()` with `VideoConfig(resolution=720)` |366| Need to overlay audio/music on video | `AudioAsset` on a `Timeline` |367| Need to add subtitles | `video.add_subtitle()` or `CaptionAsset` |368| Need to combine/trim clips | `VideoAsset` on a `Timeline` |369| Need to generate voiceover, music, or SFX | `coll.generate_voice()`, `generate_music()`, `generate_sound_effect()` |370 371## Provenance372 373Reference material for this skill is vendored locally under `skills/videodb/reference/`.374Use the local copies above instead of following external repository links at runtime.
Related skills
Agent Eval

Install Agent Eval skill for Claude Code from affaan-m/everything-claude-code.
Agent Harness Construction

Install Agent Harness Construction skill for Claude Code from affaan-m/everything-claude-code.
Agent Payment X402

Install Agent Payment X402 skill for Claude Code from affaan-m/everything-claude-code.