Install

Terminal · npx

$npx skills add https://github.com/am-will/codex-skills --skill gemini-computer-use

Works with Paperclip

How Gemini Computer Use fits into a Paperclip company.

Gemini Computer Use drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md63 linesmarkdown

Expand

1---2name: gemini-computer-use3description: Build and run Gemini 2.5 Computer Use browser-control agents with Playwright. Use when a user wants to automate web browser tasks via the Gemini Computer Use model, needs an agent loop (screenshot → function_call → action → function_response), or asks to integrate safety confirmation for risky UI actions.4---5 6# Gemini Computer Use7 8## Quick start9 101. Source the env file and set your API key:11 12   ```bash13   cp env.example env.sh14   $EDITOR env.sh15   source env.sh16   ```17 182. Create a virtual environment and install dependencies:19 20   ```bash21   python -m venv .venv22   source .venv/bin/activate23   pip install google-genai playwright24   playwright install chromium25   ```26 273. Run the agent script with a prompt:28 29   ```bash30   python scripts/computer_use_agent.py \31     --prompt "Find the latest blog post title on example.com" \32     --start-url "https://example.com" \33     --turn-limit 634   ```35 36## Browser selection37 38- Default: Playwright's bundled Chromium (no env vars required).39- Choose a channel (Chrome/Edge) with `COMPUTER_USE_BROWSER_CHANNEL`.40- Use a custom Chromium-based executable (e.g., Brave) with `COMPUTER_USE_BROWSER_EXECUTABLE`.41 42If both are set, `COMPUTER_USE_BROWSER_EXECUTABLE` takes precedence.43 44## Core workflow (agent loop)45 461. Capture a screenshot and send the user goal + screenshot to the model.472. Parse `function_call` actions in the response.483. Execute each action in Playwright.494. If a `safety_decision` is `require_confirmation`, prompt the user before executing.505. Send `function_response` objects containing the latest URL + screenshot.516. Repeat until the model returns only text (no actions) or you hit the turn limit.52 53## Operational guidance54 55- Run in a sandboxed browser profile or container.56- Use `--exclude` to block risky actions you do not want the model to take.57- Keep the viewport at 1440x900 unless you have a reason to change it.58 59## Resources60 61- Script: `scripts/computer_use_agent.py`62- Reference notes: `references/google-computer-use.md`63- Env template: `env.example`

Related skills

Llm Council

Install Llm Council skill for Claude Code from am-will/codex-skills.

Markdown Url

Install Markdown Url skill for Claude Code from am-will/codex-skills.

Openai Docs Skill

The openai-docs-skill enables Claude to query OpenAI's official developer documentation through an MCP server via CLI commands (search, fetch, list) to retrieve