How Image Gen fits into a Paperclip company.

Image Gen drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md314 linesmarkdown
Expand
1---2name: image-gen3description: |4  Generate AI images from text prompts. Triggers on: "生成图片", "画一张",5  "AI图", "generate image", "配图", "create picture", "draw", "visualize",6  "generate an image".7metadata:8  openclaw:9    emoji: "🖼️"10    requires:11      bin: ["listenhub"]12    primaryBin: "listenhub"13---14 15## When to Use16 17- User wants to generate an AI image from a text description18- User says "generate image", "draw", "create picture", "配图"19- User says "生成图片", "画一张", "AI图"20- User needs a cover image, illustration, or concept art21 22## When NOT to Use23 24- User wants to create audio content (use `/podcast`, `/speech`)25- User wants to create a video (use `/explainer`)26- User wants to edit an existing image (not supported)27- User wants to extract content from a URL (use `/content-parser`)28 29## Purpose30 31Generate AI images using the ListenHub CLI. Supports text prompts with optional reference images (local files or URLs), multiple resolutions, and aspect ratios. Images are saved as local files.32 33## Hard Constraints34 35- Always check CLI auth following `shared/cli-authentication.md`36- Follow `shared/cli-patterns.md` for command execution and error handling37- Always read config following `shared/config-pattern.md` before any interaction38- Output saved to `.listenhub/image-gen/YYYY-MM-DD-{jobId}/` — never `~/Downloads/`39 40<HARD-GATE>41Use the AskUserQuestion tool for every multiple-choice step — do NOT print options as plain text. Ask one question at a time. Wait for the user's answer before proceeding to the next step. After all parameters are collected, summarize the choices and ask the user to confirm. Do NOT call the image generation command until the user has explicitly confirmed.42</HARD-GATE>43 44## Step -1: CLI Auth Check45 46Follow `shared/cli-authentication.md` § Auth Check. If CLI is not installed or not logged in, auto-install and auto-login — never ask the user to run commands manually.47 48## Step 0: Config Setup49 50Follow `shared/config-pattern.md` Step 0 (Zero-Question Boot).51 52**If file doesn't exist** — silently create with defaults and proceed:53```bash54mkdir -p ".listenhub/image-gen"55echo '{"outputDir":".listenhub","outputMode":"inline"}' > ".listenhub/image-gen/config.json"56CONFIG_PATH=".listenhub/image-gen/config.json"57CONFIG=$(cat "$CONFIG_PATH")58```59**Do NOT ask any setup questions.** Proceed directly to the Interaction Flow.60 61**If file exists** — read config silently and proceed:62```bash63CONFIG_PATH=".listenhub/image-gen/config.json"64[ ! -f "$CONFIG_PATH" ] && CONFIG_PATH="$HOME/.listenhub/image-gen/config.json"65CONFIG=$(cat "$CONFIG_PATH")66```67 68### Setup Flow (user-initiated reconfigure only)69 70Only run when the user explicitly asks to reconfigure. Display current settings:71```72当前配置 (image-gen)：73  输出方式：{inline / download / both}74```75 76Then ask:77 781. **outputMode**: Follow `shared/output-mode.md` § Setup Flow Question.79 80Save immediately:81```bash82NEW_CONFIG=$(echo "$CONFIG" | jq --arg m "$OUTPUT_MODE" '. + {"outputMode": $m}')83echo "$NEW_CONFIG" > "$CONFIG_PATH"84CONFIG=$(cat "$CONFIG_PATH")85```86 87## Interaction Flow88 89### Step 1: Image Description90 91Free text input. Ask the user:92 93> Describe the image you want to generate.94 95If the prompt is very short (< 10 words) and the user hasn't asked for verbatim generation, offer to help enrich the prompt. Otherwise, use as-is.96 97### Step 2: Model98 99Ask:100 101```102Question: "Which model?"103Options:104  - "pro (recommended)" — gemini-3-pro-image-preview, higher quality105  - "flash" — gemini-3.1-flash-image-preview, faster and cheaper, unlocks extreme aspect ratios (1:4, 4:1, 1:8, 8:1)106```107 108### Step 3: Resolution and Aspect Ratio109 110Ask both together (independent parameters):111 112```113Question: "What resolution?"114Options:115  - "1K" — Standard quality116  - "2K (recommended)" — High quality, good balance117  - "4K" — Ultra high quality, slower generation118```119 120```121Question: "What aspect ratio?"122Options (all models):123  - "16:9" — Landscape, widescreen124  - "1:1" — Square125  - "9:16" — Portrait, phone screen126  - "Other" — 2:3, 3:2, 3:4, 4:3, 21:9127```128 129If flash model was selected, also offer: `1:4` (narrow portrait), `4:1` (wide landscape), `1:8` (extreme portrait), `8:1` (panoramic)130 131### Step 4: Reference Images (optional)132 133```134Question: "Any reference images for style guidance?"135Options:136  - "Yes" — Provide file paths or URLs137  - "No references" — Generate from prompt only138```139 140**If yes**: Collect reference image paths or URLs (comma-separated). The CLI handles both local files and URLs natively — no need to distinguish between them.141 142- Max 5 references143- Supported formats: jpg, png, webp, gif144- Max 10MB per file145 146Each reference will be passed as a `--reference` flag to the CLI.147 148### Step 5: Confirm & Generate149 150Summarize all choices:151 152```153Ready to generate image:154 155  Prompt: {prompt text}156  Model: {pro / flash}157  Resolution: {1K / 2K / 4K}158  Aspect ratio: {ratio}159  References: {yes — N image(s) / no}160 161  Proceed?162```163 164Wait for explicit confirmation before running the CLI command.165 166## Workflow167 1681. **Build CLI command**: Construct the `listenhub image create` command with all collected parameters.169 1702. **Execute**: Run the command with `run_in_background: true` and `timeout: 180000`:171 172   ```bash173   listenhub image create \174     --prompt "{description}" \175     --model "{model}" \176     --lang "{lang}" \177     --aspect-ratio {16:9|9:16|1:1} \178     --size {1K|2K|4K} \179     --json180   ```181 182   If reference images were provided, add `--reference` for each:183   ```bash184   listenhub image create \185     --prompt "{description}" \186     --model "{model}" \187     --lang "{lang}" \188     --aspect-ratio 16:9 \189     --size 2K \190     --reference ./sketch.png \191     --reference ./photo.jpg \192     --json193   ```194 195   The `--lang` flag provides a language hint for the prompt. Detect from the user's prompt language (e.g., Chinese prompt → `zh`, English prompt → `en`).196 1973. **Parse result and present**198 199   Read `OUTPUT_MODE` from config. Follow `shared/output-mode.md` for behavior.200 201   Parse the CLI JSON output to extract the image URL:202   ```bash203   IMAGE_URL=$(echo "$RESULT" | jq -r '.imageUrl')204   ```205 206   **`inline` or `both`**: Download to a temp file, then use the Read tool.207 208   ```bash209   JOB_ID=$(date +%s)210   listenhub download "$IMAGE_URL" -o /tmp/image-gen-${JOB_ID}.jpg211   ```212   Then use the Read tool on `/tmp/image-gen-{jobId}.jpg`. The image displays inline in the conversation.213 214   Present:215   ```216   图片已生成！217   ```218 219   **`download` or `both`**: Save to the artifact directory.220 221   ```bash222   JOB_ID=$(date +%s)223   DATE=$(date +%Y-%m-%d)224   JOB_DIR=".listenhub/image-gen/${DATE}-${JOB_ID}"225   mkdir -p "$JOB_DIR"226   listenhub download "$IMAGE_URL" -o "${JOB_DIR}/${JOB_ID}.jpg"227   ```228 229   Present:230   ```231   图片已生成！232 233   已保存到 .listenhub/image-gen/{YYYY-MM-DD}-{jobId}/：234     {jobId}.jpg235   ```236 237## Prompt Handling238 239**Default**: Pass the user's prompt directly without modification.240 241**When to offer optimization**:242- Prompt is very short (a few words) AND user hasn't requested verbatim243- Ask: "Would you like help enriching the prompt with style/lighting/composition details?"244 245**When to never modify**:246- Long, detailed, or structured prompts — treat the user as experienced247- User says "use this prompt exactly"248 249**Optimization techniques** (if user agrees):250- Style: "cyberpunk" → add "neon lights, futuristic, dystopian"251- Scene: time of day, lighting, weather252- Quality: "highly detailed", "8K quality", "cinematic composition"253- Always use English keywords (models trained on English)254- Show optimized prompt before submitting255 256## API Reference257 258- CLI authentication: `shared/cli-authentication.md`259- CLI execution patterns: `shared/cli-patterns.md`260- Config pattern: `shared/config-pattern.md`261- Output mode: `shared/output-mode.md`262 263## Composability264 265- **Invokes**: nothing (direct CLI call)266- **Invoked by**: platform skills for cover images (Phase 2)267 268## Example269 270**User**: "Generate an image: cyberpunk city at night"271 272**Agent workflow**:2731. Prompt is short → offer enrichment → user declines2742. Ask model → "pro"2753. Ask resolution → "2K"2764. Ask ratio → "16:9"2775. No references278 279```bash280listenhub image create \281  --prompt "cyberpunk city at night" \282  --model "gemini-3-pro-image-preview" \283  --lang en \284  --aspect-ratio 16:9 \285  --size 2K \286  --json287```288 289Parse CLI JSON output per `outputMode` (see `shared/output-mode.md`).290 291### Example 2 — With Reference Images292 293**User**: "Generate an image in this style" (provides local files and a URL)294 295**Agent workflow**:2961. Ask prompt → "a serene mountain lake at dawn"2972. Ask model → "pro"2983. Ask resolution → "2K"2994. Ask ratio → "16:9"3005. References → `/path/to/style-reference.png`, `https://example.com/photo.jpg`301 302```bash303listenhub image create \304  --prompt "a serene mountain lake at dawn" \305  --model "gemini-3-pro-image-preview" \306  --lang en \307  --aspect-ratio 16:9 \308  --size 2K \309  --reference /path/to/style-reference.png \310  --reference https://example.com/photo.jpg \311  --json312```313 314Parse CLI JSON output per `outputMode` (see `shared/output-mode.md`).
Related skills
Asr

Install Asr skill for Claude Code from marswaveai/skills.
Content Parser

Install Content Parser skill for Claude Code from marswaveai/skills.
Listenhub

Install Listenhub skill for Claude Code from marswaveai/skills.