Install

Terminal · npx

$npx skills add https://github.com/vercel-labs/agent-skills --skill vercel-react-best-practices

Works with Paperclip

How Fal Ai Media fits into a Paperclip company.

Fal Ai Media drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md284 linesmarkdown

Expand

1---2name: fal-ai-media3description: Unified media generation via fal.ai MCP — image, video, and audio. Covers text-to-image (Nano Banana), text/image-to-video (Seedance, Kling, Veo 3), text-to-speech (CSM-1B), and video-to-audio (ThinkSound). Use when the user wants to generate images, videos, or audio with AI.4origin: ECC5---6 7# fal.ai Media Generation8 9Generate images, videos, and audio using fal.ai models via MCP.10 11## When to Activate12 13- User wants to generate images from text prompts14- Creating videos from text or images15- Generating speech, music, or sound effects16- Any media generation task17- User says "generate image", "create video", "text to speech", "make a thumbnail", or similar18 19## MCP Requirement20 21fal.ai MCP server must be configured. Add to `~/.claude.json`:22 23```json24"fal-ai": {25  "command": "npx",26  "args": ["-y", "fal-ai-mcp-server"],27  "env": { "FAL_KEY": "YOUR_FAL_KEY_HERE" }28}29```30 31Get an API key at [fal.ai](https://fal.ai).32 33## MCP Tools34 35The fal.ai MCP provides these tools:36- `search` — Find available models by keyword37- `find` — Get model details and parameters38- `generate` — Run a model with parameters39- `result` — Check async generation status40- `status` — Check job status41- `cancel` — Cancel a running job42- `estimate_cost` — Estimate generation cost43- `models` — List popular models44- `upload` — Upload files for use as inputs45 46---47 48## Image Generation49 50### Nano Banana 2 (Fast)51Best for: quick iterations, drafts, text-to-image, image editing.52 53```54generate(55  app_id: "fal-ai/nano-banana-2",56  input_data: {57    "prompt": "a futuristic cityscape at sunset, cyberpunk style",58    "image_size": "landscape_16_9",59    "num_images": 1,60    "seed": 4261  }62)63```64 65### Nano Banana Pro (High Fidelity)66Best for: production images, realism, typography, detailed prompts.67 68```69generate(70  app_id: "fal-ai/nano-banana-pro",71  input_data: {72    "prompt": "professional product photo of wireless headphones on marble surface, studio lighting",73    "image_size": "square",74    "num_images": 1,75    "guidance_scale": 7.576  }77)78```79 80### Common Image Parameters81 82| Param | Type | Options | Notes |83|-------|------|---------|-------|84| `prompt` | string | required | Describe what you want |85| `image_size` | string | `square`, `portrait_4_3`, `landscape_16_9`, `portrait_16_9`, `landscape_4_3` | Aspect ratio |86| `num_images` | number | 1-4 | How many to generate |87| `seed` | number | any integer | Reproducibility |88| `guidance_scale` | number | 1-20 | How closely to follow the prompt (higher = more literal) |89 90### Image Editing91Use Nano Banana 2 with an input image for inpainting, outpainting, or style transfer:92 93```94# First upload the source image95upload(file_path: "/path/to/image.png")96 97# Then generate with image input98generate(99  app_id: "fal-ai/nano-banana-2",100  input_data: {101    "prompt": "same scene but in watercolor style",102    "image_url": "<uploaded_url>",103    "image_size": "landscape_16_9"104  }105)106```107 108---109 110## Video Generation111 112### Seedance 1.0 Pro (ByteDance)113Best for: text-to-video, image-to-video with high motion quality.114 115```116generate(117  app_id: "fal-ai/seedance-1-0-pro",118  input_data: {119    "prompt": "a drone flyover of a mountain lake at golden hour, cinematic",120    "duration": "5s",121    "aspect_ratio": "16:9",122    "seed": 42123  }124)125```126 127### Kling Video v3 Pro128Best for: text/image-to-video with native audio generation.129 130```131generate(132  app_id: "fal-ai/kling-video/v3/pro",133  input_data: {134    "prompt": "ocean waves crashing on a rocky coast, dramatic clouds",135    "duration": "5s",136    "aspect_ratio": "16:9"137  }138)139```140 141### Veo 3 (Google DeepMind)142Best for: video with generated sound, high visual quality.143 144```145generate(146  app_id: "fal-ai/veo-3",147  input_data: {148    "prompt": "a bustling Tokyo street market at night, neon signs, crowd noise",149    "aspect_ratio": "16:9"150  }151)152```153 154### Image-to-Video155Start from an existing image:156 157```158generate(159  app_id: "fal-ai/seedance-1-0-pro",160  input_data: {161    "prompt": "camera slowly zooms out, gentle wind moves the trees",162    "image_url": "<uploaded_image_url>",163    "duration": "5s"164  }165)166```167 168### Video Parameters169 170| Param | Type | Options | Notes |171|-------|------|---------|-------|172| `prompt` | string | required | Describe the video |173| `duration` | string | `"5s"`, `"10s"` | Video length |174| `aspect_ratio` | string | `"16:9"`, `"9:16"`, `"1:1"` | Frame ratio |175| `seed` | number | any integer | Reproducibility |176| `image_url` | string | URL | Source image for image-to-video |177 178---179 180## Audio Generation181 182### CSM-1B (Conversational Speech)183Text-to-speech with natural, conversational quality.184 185```186generate(187  app_id: "fal-ai/csm-1b",188  input_data: {189    "text": "Hello, welcome to the demo. Let me show you how this works.",190    "speaker_id": 0191  }192)193```194 195### ThinkSound (Video-to-Audio)196Generate matching audio from video content.197 198```199generate(200  app_id: "fal-ai/thinksound",201  input_data: {202    "video_url": "<video_url>",203    "prompt": "ambient forest sounds with birds chirping"204  }205)206```207 208### ElevenLabs (via API, no MCP)209For professional voice synthesis, use ElevenLabs directly:210 211```python212import os213import requests214 215resp = requests.post(216    "https://api.elevenlabs.io/v1/text-to-speech/<voice_id>",217    headers={218        "xi-api-key": os.environ["ELEVENLABS_API_KEY"],219        "Content-Type": "application/json"220    },221    json={222        "text": "Your text here",223        "model_id": "eleven_turbo_v2_5",224        "voice_settings": {"stability": 0.5, "similarity_boost": 0.75}225    }226)227with open("output.mp3", "wb") as f:228    f.write(resp.content)229```230 231### VideoDB Generative Audio232If VideoDB is configured, use its generative audio:233 234```python235# Voice generation236audio = coll.generate_voice(text="Your narration here", voice="alloy")237 238# Music generation239music = coll.generate_music(prompt="upbeat electronic background music", duration=30)240 241# Sound effects242sfx = coll.generate_sound_effect(prompt="thunder crack followed by rain")243```244 245---246 247## Cost Estimation248 249Before generating, check estimated cost:250 251```252estimate_cost(253  estimate_type: "unit_price",254  endpoints: {255    "fal-ai/nano-banana-pro": {256      "unit_quantity": 1257    }258  }259)260```261 262## Model Discovery263 264Find models for specific tasks:265 266```267search(query: "text to video")268find(endpoint_ids: ["fal-ai/seedance-1-0-pro"])269models()270```271 272## Tips273 274- Use `seed` for reproducible results when iterating on prompts275- Start with lower-cost models (Nano Banana 2) for prompt iteration, then switch to Pro for finals276- For video, keep prompts descriptive but concise — focus on motion and scene277- Image-to-video produces more controlled results than pure text-to-video278- Check `estimate_cost` before running expensive video generations279 280## Related Skills281 282- `videodb` — Video processing, editing, and streaming283- `video-editing` — AI-powered video editing workflows284- `content-engine` — Content creation for social platforms

Related skills

Agent Eval

Install Agent Eval skill for Claude Code from affaan-m/everything-claude-code.

Agent Harness Construction

Install Agent Harness Construction skill for Claude Code from affaan-m/everything-claude-code.

Agent Payment X402

Install Agent Payment X402 skill for Claude Code from affaan-m/everything-claude-code.