Name: Videoagent Video Studio
Author: Pexoai

Install

Terminal · npx

$npx skills add https://github.com/pexoai/pexo-skills --skill videoagent-video-studio

Works with Paperclip

How Videoagent Video Studio fits into a Paperclip company.

Videoagent Video Studio drops into any Paperclip agent that handles - video work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md214 linesmarkdown

Expand

1---2name: videoagent-video-studio3version: 2.1.04author: pexoai5emoji: "🎬"6tags:7  - video8  - video-generation9  - text-to-video10  - image-to-video11  - veo12  - grok13  - kling14  - seedance15  - minimax16  - hunyuan17  - pixverse18description: >19  Generate short AI videos from text or images — text-to-video, image-to-video, and reference-based generation — with zero API key setup. Use when the user wants to create a video clip, animate an image, or generate video from a description.20metadata:21  openclaw:22    emoji: "🎬"23    install:24      - id: node25        kind: node26        label: "No dependencies needed — all calls go through the hosted proxy"27---28 29# 🎬 VideoAgent Video Studio30 31**Use when:** User asks to generate a video, create a video from text, animate an image, make a short clip, or produce AI video.32 33Generate short AI videos with 7 backends. This skill picks the right mode (text-to-video or image-to-video), enhances the prompt for best results, and returns the video URL.34 35---36 37## Quick Reference38 39| User Intent | Mode | Typical Duration |40|-------------|------|------------------|41| "Make a video of..." (no image) | `text-to-video` | 4–10 s |42| "Animate this image" / "Make this move" | `image-to-video` | 4–6 s |43| "Turn this into a video with..." | `image-to-video` | 4–6 s |44| Cinematic, story, ad | Prefer `text-to-video` with detailed prompt | 5–10 s |45 46### Generation Modes47 48| Mode | Description | Models |49|------|-------------|--------|50| **text-to-video** | Text prompt only → video | minimax, kling, veo, hunyuan, grok, seedance |51| **image-to-video** | Single image + prompt → animated clip | minimax, kling, veo, pixverse, grok, seedance |52| **reference-based** | Reference images/video → consistent output | minimax, kling, veo, hunyuan, grok, seedance |53 54### Models (use `--model <id>`)55 56| Model ID | T2V | I2V | Reference | Notes |57|----------|-----|-----|-----------|-------|58| `minimax` | ✅ | ✅ | ✅ | Subject reference image, character consistency |59| `kling` | ✅ | ✅ | ✅ | Multi-element / character / keyframe (O3) |60| `veo` | ✅ | ✅ | ✅ | Google Veo 3.1, multiple reference images |61| `hunyuan` | ✅ | — | ✅ | Video-to-video style transfer |62| `pixverse` | — | ✅ | — | Stylized image-to-video |63| `grok` | ✅ | ✅ | ✅ | Video editing via reference video |64| `seedance` | ✅ | ✅ | ✅ | Seedance 1.5 Pro, synchronized audio, 4–12 s |65 66Full model details and endpoint reference: [references/models.md](references/models.md).67 68---69 70## How to Generate a Video71 72### Step 1 — Choose mode and enhance the prompt73 74- **Text-to-video**: Expand with subject, action, camera movement, lighting, and style. Be specific about motion (e.g. "camera slowly zooms in", "character walks left to right").75- **Image-to-video**: Describe the motion to apply to the image (e.g. "gentle breeze in the hair", "camera pans across the scene"). See [references/prompt_guide.md](references/prompt_guide.md) for patterns.76 77### Step 2 — Run the script78 79**Text-to-video:**80```bash81node {baseDir}/tools/generate.js \82  --mode text-to-video \83  --prompt "<enhanced prompt>" \84  --duration <seconds> \85  --aspect-ratio <ratio>86```87 88**Image-to-video:**89```bash90node {baseDir}/tools/generate.js \91  --mode image-to-video \92  --prompt "<motion description>" \93  --image-url "<public image URL>" \94  --duration <seconds> \95  --aspect-ratio <ratio>96```97 98**Parameters:**99 100| Parameter | Default | Description |101|-----------|---------|-------------|102| `--mode` | `text-to-video` | `text-to-video` or `image-to-video` |103| `--prompt` | *(required)* | Scene or motion description |104| `--image-url` | — | Required for `image-to-video`; public image URL |105| `--duration` | `5` | Length in seconds (typically 4–10) |106| `--aspect-ratio` | `16:9` | `16:9`, `9:16`, `1:1`, `4:3`, `3:4` |107| `--model` | `auto` | Model ID (e.g. `kling`, `veo`, `grok`, `seedance`); `auto` = proxy picks |108 109**Other commands:**110 111| Command | Description |112|---------|-------------|113| `node tools/generate.js --list-models` | List available models from the proxy |114| `node tools/generate.js --status --job-id <id>` | Check async job status |115 116### Step 3 — Return the result117 118The script returns JSON:119 120```json121{122  "success": true,123  "mode": "text-to-video",124  "videoUrl": "https://...",125  "duration": 5,126  "aspectRatio": "16:9"127}128```129 130Send `videoUrl` to the user.131 132---133 134## Example Conversations135 136**User:** "Generate a short video of a cat walking in the rain, cinematic."137 138```bash139node {baseDir}/tools/generate.js \140  --mode text-to-video \141  --prompt "A cat walking through rain, wet streets, neon reflections, cinematic lighting, slow motion, 4K" \142  --duration 5 \143  --aspect-ratio 16:9144```145 146---147 148**User:** "Animate this photo" *(user uploads a landscape)*149 150```bash151node {baseDir}/tools/generate.js \152  --mode image-to-video \153  --prompt "Gentle clouds moving across the sky, subtle grass movement, cinematic atmosphere" \154  --image-url "https://..." \155  --duration 5 \156  --aspect-ratio 16:9157```158 159---160 161**User:** "Make a 10-second vertical video of a coffee pour, slow motion."162 163```bash164node {baseDir}/tools/generate.js \165  --mode text-to-video \166  --prompt "Close-up of coffee pouring into a white cup, slow motion, steam rising, soft lighting, product shot" \167  --duration 10 \168  --aspect-ratio 9:16169```170 171---172 173**User:** "Use Google Veo for a cinematic shot."174 175```bash176node {baseDir}/tools/generate.js \177  --mode text-to-video \178  --model veo \179  --prompt "A dragon flying through cloudy skies, cinematic lighting, 8s" \180  --duration 8 \181  --aspect-ratio 16:9182```183 184---185 186**User:** "Animate this portrait."187 188```bash189node {baseDir}/tools/generate.js \190  --mode image-to-video \191  --model grok \192  --prompt "Gentle smile, subtle head turn" \193  --image-url "https://..." \194  --duration 5195```196 197---198 199## Setup200 201**Zero API keys by default.** Requests go through a hosted proxy. Set these for a custom proxy or token:202 203| Variable | Required | Description |204|----------|----------|-------------|205| `VIDEO_STUDIO_PROXY_URL` | No | Proxy base URL |206| `VIDEO_STUDIO_TOKEN` | No | Auth token if the proxy requires it |207 208---209 210## Knowledge Base211 212- **[references/prompt_guide.md](references/prompt_guide.md)** — Prompt patterns for text-to-video and image-to-video.213- **[references/models.md](references/models.md)** — Model list, capabilities, and selection guide.214- **[references/calling_guide.md](references/calling_guide.md)** — Per-model endpoint details, input parameters, and special handling.

Related skills

Pexo Agent

A solid integration for generating short videos through Pexo's AI platform without leaving your Claude workflow. Handles the full pipeline from uploading assets

Seedance 2.0 Prompter

Install Seedance 2.0 Prompter skill for Claude Code from pexoai/pexo-skills.

Videoagent Audio Studio

This is essentially an audio API router that saves you from juggling multiple services. Point it at text and it'll generate speech via ElevenLabs, ask for backg