Install
Terminal · npx
$npx skills add https://github.com/coreyhaines31/marketingskills --skill analytics-tracking
Works with Paperclip
How Baoyu Imagine fits into a Paperclip company.

Baoyu Imagine drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md490 linesmarkdown
Expand
1---2name: baoyu-imagine3description: AI image generation with OpenAI, Azure OpenAI, Google, OpenRouter, DashScope, Z.AI GLM-Image, MiniMax, Jimeng, Seedream and Replicate APIs. Supports text-to-image, reference images, aspect ratios, and batch generation from saved prompt files. Sequential by default; use batch parallel generation when the user already has multiple prompts or wants stable multi-image throughput. Use when user asks to generate, create, or draw images.4version: 1.57.05metadata:6  openclaw:7    homepage: https://github.com/JimLiu/baoyu-skills#baoyu-imagine8    requires:9      anyBins:10        - bun11        - npx12---13 14# Image Generation (AI SDK)15 16Official API-based image generation. Supports OpenAI, Azure OpenAI, Google, OpenRouter, DashScope (阿里通义万象), Z.AI GLM-Image, MiniMax, Jimeng (即梦), Seedream (豆包) and Replicate providers.17 18## Script Directory19 20**Agent Execution**:211. `{baseDir}` = this SKILL.md file's directory222. Script path = `{baseDir}/scripts/main.ts`233. Resolve `${BUN_X}` runtime: if `bun` installed → `bun`; if `npx` available → `npx -y bun`; else suggest installing bun24 25## Step 0: Load Preferences ⛔ BLOCKING26 27**CRITICAL**: This step MUST complete BEFORE any image generation. Do NOT skip or defer.28 29Check EXTEND.md existence (priority: project → user):30 31```bash32# macOS, Linux, WSL, Git Bash33test -f .baoyu-skills/baoyu-imagine/EXTEND.md && echo "project"34test -f "${XDG_CONFIG_HOME:-$HOME/.config}/baoyu-skills/baoyu-imagine/EXTEND.md" && echo "xdg"35test -f "$HOME/.baoyu-skills/baoyu-imagine/EXTEND.md" && echo "user"36```37 38```powershell39# PowerShell (Windows)40if (Test-Path .baoyu-skills/baoyu-imagine/EXTEND.md) { "project" }41$xdg = if ($env:XDG_CONFIG_HOME) { $env:XDG_CONFIG_HOME } else { "$HOME/.config" }42if (Test-Path "$xdg/baoyu-skills/baoyu-imagine/EXTEND.md") { "xdg" }43if (Test-Path "$HOME/.baoyu-skills/baoyu-imagine/EXTEND.md") { "user" }44```45 46| Result | Action |47|--------|--------|48| Found | Load, parse, apply settings. If `default_model.[provider]` is null → ask model only (Flow 2) |49| Not found | ⛔ Run first-time setup ([references/config/first-time-setup.md](references/config/first-time-setup.md)) → Save EXTEND.md → Then continue |50 51**CRITICAL**: If not found, complete the full setup (provider + model + quality + save location) using AskUserQuestion BEFORE generating any images. Generation is BLOCKED until EXTEND.md is created.52 53| Path | Location |54|------|----------|55| `.baoyu-skills/baoyu-imagine/EXTEND.md` | Project directory |56| `$HOME/.baoyu-skills/baoyu-imagine/EXTEND.md` | User home |57 58Legacy compatibility: if `.baoyu-skills/baoyu-image-gen/EXTEND.md` exists and the new path does not, runtime renames it to `baoyu-imagine`. If both files exist, runtime leaves them unchanged and uses the new path.59 60**EXTEND.md Supports**: Default provider | Default quality | Default aspect ratio | Default image size | OpenAI image API dialect | Default models | Batch worker cap | Provider-specific batch limits61 62Schema: `references/config/preferences-schema.md`63 64## Usage65 66```bash67# Basic68${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image cat.png69 70# With aspect ratio71${BUN_X} {baseDir}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:972 73# High quality74${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k75 76# From prompt files77${BUN_X} {baseDir}/scripts/main.ts --promptfiles system.md content.md --image out.png78 79# With reference images (Google, OpenAI, Azure OpenAI, OpenRouter, Replicate supported families, MiniMax, or Seedream 4.0/4.5/5.0)80${BUN_X} {baseDir}/scripts/main.ts --prompt "Make blue" --image out.png --ref source.png81 82# With reference images (explicit provider/model)83${BUN_X} {baseDir}/scripts/main.ts --prompt "Make blue" --image out.png --provider google --model gemini-3-pro-image-preview --ref source.png84 85# Azure OpenAI (model means deployment name)86${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image out.png --provider azure --model gpt-image-1.587 88# OpenRouter (recommended default model)89${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image out.png --provider openrouter90 91# OpenRouter with reference images92${BUN_X} {baseDir}/scripts/main.ts --prompt "Make blue" --image out.png --provider openrouter --model google/gemini-3.1-flash-image-preview --ref source.png93 94# Specific provider95${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image out.png --provider openai96 97# DashScope (阿里通义万象)98${BUN_X} {baseDir}/scripts/main.ts --prompt "一只可爱的猫" --image out.png --provider dashscope99 100# DashScope Qwen-Image 2.0 Pro (recommended for custom sizes and text rendering)101${BUN_X} {baseDir}/scripts/main.ts --prompt "为咖啡品牌设计一张 21:9 横幅海报，包含清晰中文标题" --image out.png --provider dashscope --model qwen-image-2.0-pro --size 2048x872102 103# DashScope legacy Qwen fixed-size model104${BUN_X} {baseDir}/scripts/main.ts --prompt "一张电影感海报" --image out.png --provider dashscope --model qwen-image-max --size 1664x928105 106# Z.AI GLM-image107${BUN_X} {baseDir}/scripts/main.ts --prompt "一张带清晰中文标题的科技海报" --image out.png --provider zai108 109# Z.AI GLM-image with explicit custom size110${BUN_X} {baseDir}/scripts/main.ts --prompt "A science illustration with labels" --image out.png --provider zai --model glm-image --size 1472x1088111 112# MiniMax113${BUN_X} {baseDir}/scripts/main.ts --prompt "A fashion editorial portrait by a bright studio window" --image out.jpg --provider minimax114 115# MiniMax with subject reference (best for character/portrait consistency)116${BUN_X} {baseDir}/scripts/main.ts --prompt "A girl stands by the library window, cinematic lighting" --image out.jpg --provider minimax --model image-01 --ref portrait.png --ar 16:9117 118# MiniMax with custom size (documented for image-01)119${BUN_X} {baseDir}/scripts/main.ts --prompt "A cinematic poster" --image out.jpg --provider minimax --model image-01 --size 1536x1024120 121# Replicate (default: google/nano-banana-2)122${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate123 124# Replicate Seedream 4.5125${BUN_X} {baseDir}/scripts/main.ts --prompt "A cinematic portrait" --image out.png --provider replicate --model bytedance/seedream-4.5 --ar 3:2126 127# Replicate Wan 2.7 Image Pro128${BUN_X} {baseDir}/scripts/main.ts --prompt "A concept frame" --image out.png --provider replicate --model wan-video/wan-2.7-image-pro --size 2048x1152129 130# Batch mode with saved prompt files131${BUN_X} {baseDir}/scripts/main.ts --batchfile batch.json132 133# Batch mode with explicit worker count134${BUN_X} {baseDir}/scripts/main.ts --batchfile batch.json --jobs 4 --json135```136 137### Batch File Format138 139```json140{141  "jobs": 4,142  "tasks": [143    {144      "id": "hero",145      "promptFiles": ["prompts/hero.md"],146      "image": "out/hero.png",147      "provider": "replicate",148      "model": "google/nano-banana-2",149      "ar": "16:9",150      "quality": "2k"151    },152    {153      "id": "diagram",154      "promptFiles": ["prompts/diagram.md"],155      "image": "out/diagram.png",156      "ref": ["references/original.png"]157    }158  ]159}160```161 162Paths in `promptFiles`, `image`, and `ref` are resolved relative to the batch file's directory. `jobs` is optional (overridden by CLI `--jobs`). Top-level array format (without `jobs` wrapper) is also accepted.163 164## Options165 166| Option | Description |167|--------|-------------|168| `--prompt <text>`, `-p` | Prompt text |169| `--promptfiles <files...>` | Read prompt from files (concatenated) |170| `--image <path>` | Output image path (required in single-image mode) |171| `--batchfile <path>` | JSON batch file for multi-image generation |172| `--jobs <count>` | Worker count for batch mode (default: auto, max from config, built-in default 10) |173| `--provider google\|openai\|azure\|openrouter\|dashscope\|zai\|minimax\|jimeng\|seedream\|replicate` | Force provider (default: auto-detect) |174| `--model <id>`, `-m` | Model ID (Google: `gemini-3-pro-image-preview`; OpenAI: `gpt-image-1.5`; Azure: deployment name such as `gpt-image-1.5` or `image-prod`; OpenRouter: `google/gemini-3.1-flash-image-preview`; DashScope: `qwen-image-2.0-pro`; Z.AI: `glm-image`; MiniMax: `image-01`) |175| `--ar <ratio>` | Aspect ratio (e.g., `16:9`, `1:1`, `4:3`) |176| `--size <WxH>` | Size (e.g., `1024x1024`) |177| `--quality normal\|2k` | Quality preset (default: `2k`) |178| `--imageSize 1K\|2K\|4K` | Image size for Google/OpenRouter (default: from quality) |179| `--imageApiDialect openai-native\|ratio-metadata` | OpenAI-compatible image API dialect. Use `ratio-metadata` when the endpoint is OpenAI-compatible but expects aspect-ratio `size` plus `metadata.resolution` instead of pixel `size` |180| `--ref <files...>` | Reference images. Supported by Google multimodal, OpenAI GPT Image edits, Azure OpenAI edits (PNG/JPG only), OpenRouter multimodal models, Replicate supported families, MiniMax subject-reference, and Seedream 5.0/4.5/4.0. Not supported by Jimeng, Seedream 3.0, or removed SeedEdit 3.0 |181| `--n <count>` | Number of images. Replicate currently supports only `--n 1` because this path saves exactly one output image |182| `--json` | JSON output |183 184## Environment Variables185 186| Variable | Description |187|----------|-------------|188| `OPENAI_API_KEY` | OpenAI API key |189| `AZURE_OPENAI_API_KEY` | Azure OpenAI API key |190| `OPENROUTER_API_KEY` | OpenRouter API key |191| `GOOGLE_API_KEY` | Google API key |192| `DASHSCOPE_API_KEY` | DashScope API key (阿里云) |193| `ZAI_API_KEY` | Z.AI API key |194| `BIGMODEL_API_KEY` | Backward-compatible alias for Z.AI API key |195| `MINIMAX_API_KEY` | MiniMax API key |196| `REPLICATE_API_TOKEN` | Replicate API token |197| `JIMENG_ACCESS_KEY_ID` | Jimeng (即梦) Volcengine access key |198| `JIMENG_SECRET_ACCESS_KEY` | Jimeng (即梦) Volcengine secret key |199| `ARK_API_KEY` | Seedream (豆包) Volcengine ARK API key |200| `OPENAI_IMAGE_MODEL` | OpenAI model override |201| `AZURE_OPENAI_DEPLOYMENT` | Azure default deployment name |202| `AZURE_OPENAI_IMAGE_MODEL` | Backward-compatible alias for Azure default deployment/model name |203| `OPENROUTER_IMAGE_MODEL` | OpenRouter model override (default: `google/gemini-3.1-flash-image-preview`) |204| `GOOGLE_IMAGE_MODEL` | Google model override |205| `DASHSCOPE_IMAGE_MODEL` | DashScope model override (default: `qwen-image-2.0-pro`) |206| `ZAI_IMAGE_MODEL` | Z.AI model override (default: `glm-image`) |207| `BIGMODEL_IMAGE_MODEL` | Backward-compatible alias for Z.AI model override |208| `MINIMAX_IMAGE_MODEL` | MiniMax model override (default: `image-01`) |209| `REPLICATE_IMAGE_MODEL` | Replicate model override (default: google/nano-banana-2) |210| `JIMENG_IMAGE_MODEL` | Jimeng model override (default: jimeng_t2i_v40) |211| `SEEDREAM_IMAGE_MODEL` | Seedream model override (default: doubao-seedream-5-0-260128) |212| `OPENAI_BASE_URL` | Custom OpenAI endpoint |213| `OPENAI_IMAGE_API_DIALECT` | OpenAI-compatible image API dialect override (`openai-native` or `ratio-metadata`) |214| `AZURE_OPENAI_BASE_URL` | Azure resource endpoint or deployment endpoint |215| `AZURE_API_VERSION` | Azure image API version (default: `2025-04-01-preview`) |216| `OPENROUTER_BASE_URL` | Custom OpenRouter endpoint (default: `https://openrouter.ai/api/v1`) |217| `OPENROUTER_HTTP_REFERER` | Optional app/site URL for OpenRouter attribution |218| `OPENROUTER_TITLE` | Optional app name for OpenRouter attribution |219| `GOOGLE_BASE_URL` | Custom Google endpoint |220| `DASHSCOPE_BASE_URL` | Custom DashScope endpoint |221| `ZAI_BASE_URL` | Custom Z.AI endpoint (default: `https://api.z.ai/api/paas/v4`) |222| `BIGMODEL_BASE_URL` | Backward-compatible alias for Z.AI endpoint |223| `MINIMAX_BASE_URL` | Custom MiniMax endpoint (default: `https://api.minimax.io`) |224| `REPLICATE_BASE_URL` | Custom Replicate endpoint |225| `JIMENG_BASE_URL` | Custom Jimeng endpoint (default: `https://visual.volcengineapi.com`) |226| `JIMENG_REGION` | Jimeng region (default: `cn-north-1`) |227| `SEEDREAM_BASE_URL` | Custom Seedream endpoint (default: `https://ark.cn-beijing.volces.com/api/v3`) |228| `BAOYU_IMAGE_GEN_MAX_WORKERS` | Override batch worker cap |229| `BAOYU_IMAGE_GEN_<PROVIDER>_CONCURRENCY` | Override provider concurrency, e.g. `BAOYU_IMAGE_GEN_REPLICATE_CONCURRENCY` |230| `BAOYU_IMAGE_GEN_<PROVIDER>_START_INTERVAL_MS` | Override provider start gap, e.g. `BAOYU_IMAGE_GEN_REPLICATE_START_INTERVAL_MS` |231 232**Load Priority**: CLI args > EXTEND.md > env vars > `<cwd>/.baoyu-skills/.env` > `~/.baoyu-skills/.env`233 234## Model Resolution235 236Model priority (highest → lowest), applies to all providers:237 2381. CLI flag: `--model <id>`2392. EXTEND.md: `default_model.[provider]`2403. Env var: `<PROVIDER>_IMAGE_MODEL` (e.g., `GOOGLE_IMAGE_MODEL`)2414. Built-in default242 243For Azure, `--model` / `default_model.azure` should be the Azure deployment name. `AZURE_OPENAI_DEPLOYMENT` is the preferred env var, and `AZURE_OPENAI_IMAGE_MODEL` remains as a backward-compatible alias.244 245**EXTEND.md overrides env vars**. If both EXTEND.md `default_model.google: "gemini-3-pro-image-preview"` and env var `GOOGLE_IMAGE_MODEL=gemini-3.1-flash-image-preview` exist, EXTEND.md wins.246 247### OpenAI-Compatible Gateway Dialects248 249`provider=openai` means the auth and routing entrypoint is OpenAI-compatible. It does **not** guarantee that the upstream image API uses OpenAI native image-request semantics.250 251Use `default_image_api_dialect` in `EXTEND.md`, `OPENAI_IMAGE_API_DIALECT`, or `--imageApiDialect` when the endpoint expects a different wire format:252 253- `openai-native`: Sends pixel `size` such as `1536x1024` and native OpenAI quality fields when supported254- `ratio-metadata`: Sends aspect-ratio `size` such as `16:9` and maps quality/size intent into `metadata.resolution` (`1K|2K|4K`) plus `metadata.orientation`255 256Recommended use:257 258- OpenAI native Images API or strict clones: keep `openai-native`259- OpenAI-compatible gateways in front of Gemini or similar models: try `ratio-metadata`260 261Current limitation: `ratio-metadata` only applies to text-to-image generation. Reference-image edit flows still require `openai-native` or another provider with first-class edit support.262 263**Agent MUST display model info** before each generation:264- Show: `Using [provider] / [model]`265- Show switch hint: `Switch model: --model <id> | EXTEND.md default_model.[provider] | env <PROVIDER>_IMAGE_MODEL`266 267### DashScope Models268 269Use `--model qwen-image-2.0-pro` or set `default_model.dashscope` / `DASHSCOPE_IMAGE_MODEL` when the user wants official Qwen-Image behavior.270 271Official DashScope model families:272 273- `qwen-image-2.0-pro`, `qwen-image-2.0-pro-2026-03-03`, `qwen-image-2.0`, `qwen-image-2.0-2026-03-03`274  - Free-form `size` in `宽*高` format275  - Total pixels must stay between `512*512` and `2048*2048`276  - Default size is approximately `1024*1024`277  - Best choice for custom ratios such as `21:9` and text-heavy Chinese/English layouts278- `qwen-image-max`, `qwen-image-max-2025-12-30`, `qwen-image-plus`, `qwen-image-plus-2026-01-09`, `qwen-image`279  - Fixed sizes only: `1664*928`, `1472*1104`, `1328*1328`, `1104*1472`, `928*1664`280  - Default size is `1664*928`281  - `qwen-image` currently has the same capability as `qwen-image-plus`282- Legacy DashScope models such as `z-image-turbo`, `z-image-ultra`, `wanx-v1`283  - Keep using them only when the user explicitly asks for legacy behavior or compatibility284 285When translating CLI args into DashScope behavior:286 287- `--size` wins over `--ar`288- For `qwen-image-2.0*`, prefer explicit `--size`; otherwise infer from `--ar` and use the official recommended resolutions below289- For `qwen-image-max/plus/image`, only use the five official fixed sizes; if the requested ratio is not covered, switch to `qwen-image-2.0-pro`290- `--quality` is a baoyu-imagine compatibility preset, not a native DashScope API field. Mapping `normal` / `2k` onto the `qwen-image-2.0*` table below is an implementation inference, not an official API guarantee291 292Recommended `qwen-image-2.0*` sizes for common aspect ratios:293 294| Ratio | `normal` | `2k` |295|-------|----------|------|296| `1:1` | `1024*1024` | `1536*1536` |297| `2:3` | `768*1152` | `1024*1536` |298| `3:2` | `1152*768` | `1536*1024` |299| `3:4` | `960*1280` | `1080*1440` |300| `4:3` | `1280*960` | `1440*1080` |301| `9:16` | `720*1280` | `1080*1920` |302| `16:9` | `1280*720` | `1920*1080` |303| `21:9` | `1344*576` | `2048*872` |304 305DashScope official APIs also expose `negative_prompt`, `prompt_extend`, and `watermark`, but `baoyu-imagine` does not expose them as dedicated CLI flags today.306 307Official references:308 309- [Qwen-Image API](https://help.aliyun.com/zh/model-studio/qwen-image-api)310- [Text-to-image guide](https://help.aliyun.com/zh/model-studio/text-to-image)311- [Qwen-Image Edit API](https://help.aliyun.com/zh/model-studio/qwen-image-edit-api)312 313### Z.AI Models314 315Use `--model glm-image` or set `default_model.zai` / `ZAI_IMAGE_MODEL` when the user wants GLM-image output.316 317Official Z.AI image model options currently documented in the sync image API:318 319- `glm-image` (recommended default)320  - Text-to-image only in `baoyu-imagine`321  - Native `quality` options are `hd` and `standard`; this skill maps `2k -> hd` and `normal -> standard`322  - Recommended sizes: `1280x1280`, `1568x1056`, `1056x1568`, `1472x1088`, `1088x1472`, `1728x960`, `960x1728`323  - Custom `--size` requires width and height between `1024` and `2048`, divisible by `32`, with total pixels <= `2^22`324- `cogview-4-250304`325  - Legacy Z.AI image model family exposed by the same endpoint326  - Custom `--size` requires width and height between `512` and `2048`, divisible by `16`, with total pixels <= `2^21`327 328Notes:329 330- The official sync API returns a temporary image URL; `baoyu-imagine` downloads that URL and writes the image locally331- `--ref` is not supported for Z.AI in this skill yet332- The sync API currently returns a single image, so `--n > 1` is rejected333 334Official references:335 336- [GLM-Image Guide](https://docs.z.ai/guides/image/glm-image)337- [Generate Image API](https://docs.z.ai/api-reference/image/generate-image)338 339### MiniMax Models340 341Use `--model image-01` or set `default_model.minimax` / `MINIMAX_IMAGE_MODEL` when the user wants MiniMax image generation.342 343Official MiniMax image model options currently documented in the API reference:344 345- `image-01` (recommended default)346  - Supports text-to-image and subject-reference image generation347  - Supports official `aspect_ratio` values: `1:1`, `16:9`, `4:3`, `3:2`, `2:3`, `3:4`, `9:16`, `21:9`348  - Supports documented custom `width` / `height` output sizes when using `--size <WxH>`349  - `width` and `height` must both be between `512` and `2048`, and both must be divisible by `8`350- `image-01-live`351  - Lower-latency variant352  - Use `--ar` for sizing; MiniMax documents custom `width` / `height` as only effective for `image-01`353 354MiniMax subject reference notes:355 356- `--ref` files are sent as MiniMax `subject_reference`357- MiniMax docs currently describe `subject_reference[].type` as `character`358- Official docs say `image_file` supports public URLs or Base64 Data URLs; `baoyu-imagine` sends local refs as Data URLs359- Official docs recommend front-facing portrait references in JPG/JPEG/PNG under 10MB360 361Official references:362 363- [MiniMax Image Generation Guide](https://platform.minimax.io/docs/guides/image-generation)364- [MiniMax Text-to-Image API](https://platform.minimax.io/docs/api-reference/image-generation-t2i)365- [MiniMax Image-to-Image API](https://platform.minimax.io/docs/api-reference/image-generation-i2i)366 367### OpenRouter Models368 369Use full OpenRouter model IDs, e.g.:370 371- `google/gemini-3.1-flash-image-preview` (recommended, supports image output and reference-image workflows)372- `google/gemini-2.5-flash-image-preview`373- `black-forest-labs/flux.2-pro`374- Other OpenRouter image-capable model IDs375 376Notes:377 378- OpenRouter image generation uses `/chat/completions`, not the OpenAI `/images` endpoints379- If `--ref` is used, choose a multimodal model that supports image input and image output380- `--imageSize` maps to OpenRouter `imageGenerationOptions.size`; `--size <WxH>` is converted to the nearest OpenRouter size and inferred aspect ratio when possible381 382### Replicate Models383 384Replicate support in `baoyu-imagine` is intentionally scoped to the model families that the tool can validate locally and save without dropping outputs:385 386- `google/nano-banana*` (default: `google/nano-banana-2`)387  - Supports prompt-only and reference-image generation388  - Uses Replicate `aspect_ratio`, `resolution`, and `output_format`389  - `--size <WxH>` is accepted only as a shorthand for a documented aspect ratio plus `1K` / `2K`390- `bytedance/seedream-4.5`391  - Supports prompt-only and reference-image generation392  - Uses Replicate `size`, `aspect_ratio`, and `image_input`393  - Local validation blocks unsupported `1K` requests before the API call394- `bytedance/seedream-5-lite`395  - Supports prompt-only and reference-image generation396  - Uses Replicate `size`, `aspect_ratio`, and `image_input`397  - Local validation currently accepts `2K` / `3K` only398- `wan-video/wan-2.7-image`399  - Supports prompt-only and reference-image generation400  - Uses Replicate `size` and `images`401  - Max output size is 2K402- `wan-video/wan-2.7-image-pro`403  - Supports prompt-only and reference-image generation404  - Uses Replicate `size` and `images`405  - 4K is allowed only for text-to-image; local validation blocks `4K + --ref`406 407Guardrails:408 409- Replicate currently supports only single-output save semantics in this tool. Keep `--n 1`.410- If a Replicate model is outside the compatibility list above, `baoyu-imagine` only treats it as prompt-only and rejects advanced local options instead of guessing a nano-banana-style schema.411 412Examples:413 414```bash415# Use Replicate default model416${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate417 418# Override model explicitly419${BUN_X} {baseDir}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate --model google/nano-banana420```421 422## Provider Selection423 4241. `--ref` provided + no `--provider` → auto-select Google first, then OpenAI, then Azure, then OpenRouter, then Replicate, then Seedream, then MiniMax (MiniMax subject reference is more specialized toward character/portrait consistency)4252. `--provider` specified → use it (if `--ref`, must be `google`, `openai`, `azure`, `openrouter`, `replicate`, `seedream`, or `minimax`)4263. Only one API key available → use that provider4274. Multiple available → default to Google, then OpenAI, Azure, OpenRouter, DashScope, Z.AI, MiniMax, Replicate, Jimeng, Seedream428 429## Quality Presets430 431| Preset | Google imageSize | OpenAI Size | OpenRouter size | Replicate resolution | Use Case |432|--------|------------------|-------------|-----------------|----------------------|----------|433| `normal` | 1K | 1024px | 1K | 1K | Quick previews |434| `2k` (default) | 2K | 2048px | 2K | 2K | Covers, illustrations, infographics |435 436**Google/OpenRouter imageSize**: Can be overridden with `--imageSize 1K|2K|4K`437 438## Aspect Ratios439 440Supported: `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `2.35:1`441 442- Google multimodal: uses `imageConfig.aspectRatio`443- OpenAI: maps to closest supported size444- OpenRouter: sends `imageGenerationOptions.aspect_ratio`; if only `--size <WxH>` is given, aspect ratio is inferred automatically445- Replicate: behavior is model-family-specific. `google/nano-banana*` uses `aspect_ratio`; `bytedance/seedream-*` uses documented Replicate aspect ratios; Wan 2.7 maps `--ar` to a concrete `size`446- MiniMax: sends official `aspect_ratio` values directly; if `--size <WxH>` is given without `--ar`, `width` / `height` are sent for `image-01`447 448## Generation Mode449 450**Default**: Sequential generation.451 452**Batch Parallel Generation**: When `--batchfile` contains 2 or more pending tasks, the script automatically enables parallel generation.453 454| Mode | When to Use |455|------|-------------|456| Sequential (default) | Normal usage, single images, small batches |457| Parallel batch | Batch mode with 2+ tasks |458 459Execution choice:460 461| Situation | Preferred approach | Why |462|-----------|--------------------|-----|463| One image, or 1-2 simple images | Sequential | Lower coordination overhead and easier debugging |464| Multiple images already have saved prompt files | Batch (`--batchfile`) | Reuses finalized prompts, applies shared throttling/retries, and gives predictable throughput |465| Each image still needs separate reasoning, prompt writing, or style exploration | Subagents | The work is still exploratory, so each image may need independent analysis before generation |466| Output comes from `baoyu-article-illustrator` with `outline.md` + `prompts/` | Batch (`build-batch.ts` -> `--batchfile`) | That workflow already produces prompt files, so direct batch execution is the intended path |467 468Rule of thumb:469 470- Prefer batch over subagents once prompt files are already saved and the task is "generate all of these"471- Use subagents only when generation is coupled with per-image thinking, rewriting, or divergent creative exploration472 473Parallel behavior:474 475- Default worker count is automatic, capped by config, built-in default 10476- Provider-specific throttling is applied only in batch mode, and the built-in defaults are tuned for faster throughput while still avoiding obvious RPM bursts477- You can override worker count with `--jobs <count>`478- Each image retries automatically up to 3 attempts479- Final output includes success count, failure count, and per-image failure reasons480 481## Error Handling482 483- Missing API key → error with setup instructions484- Generation failure → auto-retry up to 3 attempts per image485- Invalid aspect ratio → warning, proceed with default486- Reference images with unsupported provider/model → error with fix hint487 488## Extension Support489 490Custom configurations via EXTEND.md. See **Preferences** section for paths and supported options.
Related skills
Baoyu Article Illustrator

Baoyu-article-illustrator analyzes article content and automatically identifies positions where visual aids would enhance understanding, then generates illustra
Baoyu Comic

baoyu-comic generates original educational comics from markdown content with customizable art styles (ligne-claire, manga, realistic, ink-brush, chalk, minimali
Baoyu Compress Image

Baoyu-compress-image compresses images to WebP or PNG format using the best available system tool (sips, cwebp, ImageMagick, or Sharp) selected based on what's