How Gemini Image fits into a Paperclip company.

Gemini Image drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md212 linesmarkdown

Expand

1---2name: gemini-image3description: Analyze images using Gemini's vision capabilities. Use for image analysis, text extraction from screenshots, and visual content understanding.4---5 6# Gemini Image Analysis7 8Analyze images using Gemini Pro's vision capabilities.9 10## Prerequisites11 12```bash13pip install google-generativeai14export GEMINI_API_KEY=your_api_key15```16 17## CLI Reference18 19### Basic Image Analysis20 21```bash22# Analyze an image23gemini -m pro -f /path/to/image.png "Describe this image in detail"24 25# With specific question26gemini -m pro -f screenshot.png "What error message is shown?"27 28# Multiple images29gemini -m pro -f image1.png -f image2.png "Compare these two images"30```31 32## Analysis Operations33 34### General Description35 36```bash37gemini -m pro -f image.png "Describe this image comprehensively:381. Main subject/content392. Colors and composition403. Text visible (if any)414. Context and purpose425. Notable details"43```44 45### Extract Text (OCR)46 47```bash48gemini -m pro -f screenshot.png "Extract all text from this image.49Format as plain text, preserving layout where possible.50Include any text in buttons, labels, or UI elements."51```52 53### Code from Screenshot54 55```bash56gemini -m pro -f code-screenshot.png "Extract the code from this screenshot.57Provide as properly formatted code with correct indentation.58Note any parts that are unclear or partially visible."59```60 61### UI Analysis62 63```bash64gemini -m pro -f ui-screenshot.png "Analyze this UI:651. What application/website is this?662. What page/screen is shown?673. Main UI elements and their purpose684. User flow/actions available695. Any UX issues or suggestions"70```71 72### Error Analysis73 74```bash75gemini -m pro -f error-screenshot.png "Analyze this error:761. What error is shown?772. What is the likely cause?783. How to fix it?794. Any related information visible?"80```81 82### Diagram Understanding83 84```bash85gemini -m pro -f diagram.png "Explain this diagram:861. What type of diagram is this?872. Main components and their relationships883. Data/process flow894. Key takeaways"90```91 92## Specific Use Cases93 94### Debug Screenshot95 96```bash97gemini -m pro -f debug-screen.png "I'm debugging an issue. From this screenshot:981. What is the current state?992. What errors or warnings are visible?1003. What should I look at?1014. Suggested next steps"102```103 104### Compare Before/After105 106```bash107gemini -m pro -f before.png -f after.png "Compare these before and after images:1081. What changed?1092. Is this an improvement?1103. Any issues in the 'after' version?1114. Anything missing?"112```113 114### Design Feedback115 116```bash117gemini -m pro -f design.png "Provide design feedback:1181. Visual hierarchy1192. Color usage1203. Typography1214. Spacing and alignment1225. Accessibility concerns1236. Suggestions for improvement"124```125 126### Data Extraction127 128```bash129gemini -m pro -f chart.png "Extract data from this chart:1301. Chart type1312. Data series and values1323. Axes labels and ranges1334. Key trends or insights1345. Output as structured data if possible"135```136 137### Form Analysis138 139```bash140gemini -m pro -f form.png "Analyze this form:1411. Form purpose1422. Fields and their types1433. Required vs optional1444. Validation rules visible1455. UX suggestions"146```147 148## Workflow Patterns149 150### Screenshot to Issue151 152```bash153# Capture screenshot (macOS)154screencapture -i /tmp/bug.png155 156# Analyze and format as issue157gemini -m pro -f /tmp/bug.png "Create a bug report from this screenshot:158 159## Summary160[One-line description]161 162## Steps to Reproduce163[Inferred from screenshot]164 165## Expected Behavior166[What should happen]167 168## Actual Behavior169[What the screenshot shows]170 171## Environment172[Any visible system info]"173```174 175### UI to Code176 177```bash178gemini -m pro -f ui-design.png "Generate React component code that recreates this UI:179- Use Tailwind CSS for styling180- Make it responsive181- Include proper TypeScript types182- Add appropriate accessibility attributes"183```184 185### Documentation186 187```bash188gemini -m pro -f app-screen.png "Write user documentation for this screen:189- What this screen is for190- How to use each feature191- Common tasks192- Tips and notes"193```194 195## Image Types Supported196 197- PNG, JPEG, GIF, WebP198- Screenshots199- Photos200- Diagrams and charts201- UI mockups202- Code snippets203- Documents204 205## Best Practices206 2071. **Use clear images** - Higher quality = better analysis2082. **Crop to relevant area** - Remove unnecessary context2093. **Ask specific questions** - Vague prompts get vague answers2104. **Provide context** - Tell Gemini what you're looking for2115. **Verify extracted text** - OCR isn't perfect2126. **Multiple angles** - Use multiple images for complex subjects

Related skills

1password

Install 1password skill for Claude Code from steipete/clawdis.

3d Web Experience

Install 3d Web Experience skill for Claude Code from sickn33/antigravity-awesome-skills.

Ab Test Setup

This handles the full A/B testing workflow from hypothesis formation to statistical analysis. It walks you through proper test design, calculates sample sizes,