Name: Metaclaw Evolving Agent
Author: Aradotso
Install
Terminal · npx
$npx skills add https://github.com/vercel-labs/agent-skills --skill vercel-react-best-practices
Works with Paperclip
How Metaclaw Evolving Agent fits into a Paperclip company.

Metaclaw Evolving Agent drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md410 linesmarkdown
Expand
1---2name: metaclaw-evolving-agent3description: Deploy and configure MetaClaw — an agent that meta-learns and evolves from live conversations using skills injection, RL training, and smart scheduling.4triggers:5  - set up metaclaw agent6  - configure evolving agent7  - metaclaw skills mode8  - metaclaw rl training9  - metaclaw madmax scheduler10  - agent meta-learning setup11  - tinker rl backend configuration12  - metaclaw proxy deployment13---14 15# MetaClaw Evolving Agent16 17> Skill by [ara.so](https://ara.so) — Daily 2026 Skills collection18 19MetaClaw is an OpenAI-compatible proxy agent that intercepts conversations, injects learned skills, and continuously improves itself through real-world interactions. It supports three modes: lightweight skills injection, immediate RL training, and a smart "madmax" scheduler that defers weight updates to idle/sleep windows.20 21---22 23## Installation24 25```bash26# Minimal — skills injection only, no GPU required27pip install -e .28 29# Full RL training support (torch, transformers, tinker)30pip install -e ".[rl]"31 32# Skill evolution via LLM summarization33pip install -e ".[evolve]"34 35# Google Calendar scheduler for madmax mode36pip install -e ".[scheduler]"37 38# Recommended: everything39pip install -e ".[rl,evolve,scheduler]"40```41 42---43 44## Quick Start45 46```bash47# One-time interactive config wizard48metaclaw setup49 50# Start in default madmax mode (skills + RL + smart scheduler)51metaclaw start52 53# Skills only — no GPU, no Tinker needed54metaclaw start --mode skills_only55 56# RL mode — trains immediately when batch is full57metaclaw start --mode rl58 59# RL without scheduler (same as above, explicit)60metaclaw start --mode rl61```62 63After `metaclaw start`, a local OpenAI-compatible proxy is running. Point your client (OpenClaw or any OpenAI SDK consumer) at `http://localhost:<port>` instead of the upstream LLM endpoint.64 65---66 67## Configuration68 69`metaclaw setup` writes a config file (default: `~/.metaclaw/config.yaml`). You can also edit it directly:70 71```yaml72# ~/.metaclaw/config.yaml73 74proxy:75  host: 0.0.0.076  port: 808077 78llm:79  provider: kimi          # kimi | qwen | claude | minimax | openai | gemini80  base_url: https://api.moonshot.cn/v181  model: moonshot-v1-8k82  # api_key loaded from env: METACLAW_LLM_API_KEY83 84skills:85  enabled: true86  max_injected: 5         # max skills injected per turn87  summarize_after_session: true88 89rl:90  enabled: true91  backend: auto           # auto | tinker | mint92  batch_size: 3293  algorithm: grpo94  opd_teacher: false      # optional teacher distillation95 96scheduler:                # madmax mode only97  enabled: true98  sleep_hours: [22, 7]    # local 22:00–07:0099  idle_timeout_minutes: 15100  google_calendar: false  # set true + configure OAuth for meeting detection101 102logging:103  level: info104  log_dir: ~/.metaclaw/logs105```106 107### Environment Variables108 109```bash110export METACLAW_LLM_API_KEY="your-llm-api-key"111export METACLAW_TINKER_API_KEY="your-tinker-api-key"   # rl mode112export METACLAW_MINT_API_KEY="your-mint-api-key"        # if backend=mint113export GOOGLE_CALENDAR_CREDENTIALS_PATH="path/to/creds.json"  # scheduler114```115 116---117 118## Operating Modes119 120| Mode | Command | GPU Required | Description |121|------|---------|--------------|-------------|122| `skills_only` | `metaclaw start --mode skills_only` | No | Proxy + skills injection + auto-summarization |123| `rl` | `metaclaw start --mode rl` | Via API | Skills + GRPO training when batch fills |124| `madmax` | `metaclaw start` | Via API | Skills + RL + scheduler (trains only during idle/sleep/meetings) |125 126---127 128## Python API129 130### Programmatic startup131 132```python133import asyncio134from metaclaw import MetaClawAgent, AgentConfig, Mode135 136async def main():137    config = AgentConfig.from_yaml("~/.metaclaw/config.yaml")138    agent = MetaClawAgent(config, mode=Mode.MADMAX)139    await agent.start()140 141asyncio.run(main())142```143 144### Manual skill injection145 146```python147from metaclaw.skills import SkillStore, SkillInjector148 149store = SkillStore(path="~/.metaclaw/skills")150 151# Add a skill manually152store.add(153    name="code-review-checklist",154    content="Always check for: 1) error handling, 2) type hints, 3) docstrings.",155    tags=["code", "review"]156)157 158# Retrieve top-k relevant skills for a query159injector = SkillInjector(store)160relevant = injector.retrieve(query="review my Python function", top_k=3)161for skill in relevant:162    print(skill.name, skill.score)163```164 165### Intercepting and recording conversations166 167```python168from metaclaw.proxy import ConversationInterceptor169from metaclaw.memory import ExperienceBuffer170 171buffer = ExperienceBuffer(max_size=1000)172 173interceptor = ConversationInterceptor(174    upstream_url="https://api.moonshot.cn/v1",175    on_complete=buffer.record   # called after each turn with (messages, response)176)177 178# buffer.record signature:179async def on_complete(messages: list[dict], response: dict) -> None:180    ...181```182 183### Triggering RL training manually184 185```python186from metaclaw.training import RLTrainer, TrainingConfig187 188trainer = RLTrainer(189    config=TrainingConfig(190        backend="tinker",       # or "mint"191        algorithm="grpo",192        batch_size=32,193        lora_rank=16,194    )195)196 197# Collect a batch from the experience buffer and train198async def run_training(buffer):199    batch = buffer.sample(n=32, split="support")   # support/query separation200    result = await trainer.train(batch)201    print(f"Training complete. Loss: {result.loss:.4f}, Steps: {result.steps}")202```203 204### Reward modeling205 206```python207from metaclaw.rewards import RewardModel208 209reward_model = RewardModel(provider="llm")  # uses configured LLM for scoring210 211async def score_turn(prompt: str, response: str) -> float:212    score = await reward_model.score(prompt=prompt, response=response)213    return score  # float in [-1.0, 1.0]214```215 216---217 218## Skills Lifecycle219 220```221Conversation turn222       │223       ▼224 SkillInjector.retrieve()   ← vector search over SkillStore225       │  injects top-k skills into system prompt226       ▼227 LLM responds228       │229       ▼230 ExperienceBuffer.record()  ← stores (context, response, metadata)231       │232       ▼ (end of session)233 SkillSummarizer.run()      ← LLM extracts reusable patterns234       │235       ▼236 SkillStore.upsert()        ← new/updated skills persisted to disk237```238 239---240 241## Integration: OpenAI SDK as Client242 243Point any OpenAI SDK client at the MetaClaw proxy:244 245```python246from openai import OpenAI247 248# MetaClaw proxy is running on localhost:8080249client = OpenAI(250    base_url="http://localhost:8080/v1",251    api_key="not-used-but-required-by-sdk"252)253 254response = client.chat.completions.create(255    model="moonshot-v1-8k",   # passed through to upstream256    messages=[257        {"role": "user", "content": "Review my pull request strategy."}258    ]259)260print(response.choices[0].message.content)261```262 263Skills are injected transparently — the client code does not change.264 265---266 267## Scheduler (MadMax Mode)268 269The scheduler ensures RL weight updates never interrupt active use:270 271```python272from metaclaw.scheduler import MadMaxScheduler, SchedulerConfig273 274scheduler = MadMaxScheduler(275    config=SchedulerConfig(276        sleep_hours=(22, 7),          # train between 22:00–07:00 local time277        idle_timeout_minutes=15,      # train after 15 min of no conversations278        google_calendar=True,         # also train during calendar meetings279        credentials_path="creds.json"280    )281)282 283# Check if it's safe to train right now284if await scheduler.is_training_window():285    await trainer.train(batch)286```287 288### Google Calendar Setup289 290```bash291# 1. Enable Google Calendar API in Google Cloud Console292# 2. Download OAuth2 credentials as creds.json293# 3. Set path in config or env294export GOOGLE_CALENDAR_CREDENTIALS_PATH="/path/to/creds.json"295 296# 4. First run will open browser for OAuth consent297metaclaw start298```299 300---301 302## Support/Query Set Separation303 304MetaClaw separates experience into support and query sets to prevent stale rewards from polluting updates:305 306```python307from metaclaw.memory import ExperienceBuffer308 309buffer = ExperienceBuffer(310    max_size=2000,311    support_ratio=0.5   # 50% support, 50% query312)313 314# During training:315support_batch = buffer.sample(n=16, split="support")  # used to compute reward signal316query_batch   = buffer.sample(n=16, split="query")    # used for gradient update317 318await trainer.train_meta(support=support_batch, query=query_batch)319```320 321---322 323## RL Backends324 325### Tinker (default)326 327```yaml328rl:329  backend: tinker330  tinker_project: my-metaclaw-project331  lora_rank: 16332  learning_rate: 1e-4333```334 335### MinT336 337```bash338# Install MinT compatibility layer separately339pip install metaclaw-mint340```341 342```yaml343rl:344  backend: mint345  mint_endpoint: https://your-mint-endpoint346```347 348### Auto-detection349 350```yaml351rl:352  backend: auto   # tries tinker first, falls back to mint, errors if neither available353```354 355---356 357## Troubleshooting358 359**Proxy not reachable after `metaclaw start`**360- Check port conflicts: `lsof -i :8080`361- Change `proxy.port` in config and restart362 363**`rl` mode: "No training backend available"**364- Ensure `pip install -e ".[rl]"` completed successfully365- Verify `METACLAW_TINKER_API_KEY` or `METACLAW_MINT_API_KEY` is set366- Try `rl.backend: tinker` explicitly instead of `auto`367 368**Skills not persisting between sessions**369- Confirm `skills.summarize_after_session: true` in config370- Check write permissions on `~/.metaclaw/skills/`371- Run `metaclaw skills list` to inspect stored skills372 373**Madmax mode never trains**374- Verify `scheduler.sleep_hours` covers your timezone's night375- Lower `scheduler.idle_timeout_minutes` for testing (e.g., `1`)376- Check scheduler logs: `~/.metaclaw/logs/scheduler.log`377 378**Google Calendar integration fails**379- Re-run OAuth flow: delete `~/.metaclaw/token.json` and restart380- Ensure Calendar API is enabled in your Google Cloud project381 382**OPD teacher distillation errors**383- Only supported with `rl.backend: tinker`384- Requires a separate teacher model endpoint in config:385  ```yaml386  rl:387    opd_teacher: true388    teacher_base_url: https://api.openai.com/v1389    teacher_model: gpt-4o390  ```391 392---393 394## CLI Reference395 396```bash397metaclaw setup                   # interactive config wizard398metaclaw start                   # start in madmax mode399metaclaw start --mode skills_only400metaclaw start --mode rl401metaclaw start --config path/to/config.yaml402 403metaclaw skills list             # show all stored skills404metaclaw skills delete <name>    # remove a skill405metaclaw skills export skills.json406 407metaclaw status                  # show proxy, scheduler, training status408metaclaw logs                    # tail all logs409metaclaw logs --component scheduler410```
Related skills
Agency Agents Ai Specialists

Install Agency Agents Ai Specialists skill for Claude Code from aradotso/trending-skills.
Agent Browser Automation

Install Agent Browser Automation skill for Claude Code from aradotso/trending-skills.
Antigravity Manager

Install Antigravity Manager skill for Claude Code from aradotso/trending-skills.