Install

Terminal · npx

$npx skills add https://github.com/affaan-m/everything-claude-code --skill foundation-models-on-device

Works with Paperclip

How Foundation Models On Device fits into a Paperclip company.

Foundation Models On Device drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md243 linesmarkdown

Expand

1---2name: foundation-models-on-device3description: Apple FoundationModels framework for on-device LLM — text generation, guided generation with @Generable, tool calling, and snapshot streaming in iOS 26+.4---5 6# FoundationModels: On-Device LLM (iOS 26)7 8Patterns for integrating Apple's on-device language model into apps using the FoundationModels framework. Covers text generation, structured output with `@Generable`, custom tool calling, and snapshot streaming — all running on-device for privacy and offline support.9 10## When to Activate11 12- Building AI-powered features using Apple Intelligence on-device13- Generating or summarizing text without cloud dependency14- Extracting structured data from natural language input15- Implementing custom tool calling for domain-specific AI actions16- Streaming structured responses for real-time UI updates17- Need privacy-preserving AI (no data leaves the device)18 19## Core Pattern — Availability Check20 21Always check model availability before creating a session:22 23```swift24struct GenerativeView: View {25    private var model = SystemLanguageModel.default26 27    var body: some View {28        switch model.availability {29        case .available:30            ContentView()31        case .unavailable(.deviceNotEligible):32            Text("Device not eligible for Apple Intelligence")33        case .unavailable(.appleIntelligenceNotEnabled):34            Text("Please enable Apple Intelligence in Settings")35        case .unavailable(.modelNotReady):36            Text("Model is downloading or not ready")37        case .unavailable(let other):38            Text("Model unavailable: \(other)")39        }40    }41}42```43 44## Core Pattern — Basic Session45 46```swift47// Single-turn: create a new session each time48let session = LanguageModelSession()49let response = try await session.respond(to: "What's a good month to visit Paris?")50print(response.content)51 52// Multi-turn: reuse session for conversation context53let session = LanguageModelSession(instructions: """54    You are a cooking assistant.55    Provide recipe suggestions based on ingredients.56    Keep suggestions brief and practical.57    """)58 59let first = try await session.respond(to: "I have chicken and rice")60let followUp = try await session.respond(to: "What about a vegetarian option?")61```62 63Key points for instructions:64- Define the model's role ("You are a mentor")65- Specify what to do ("Help extract calendar events")66- Set style preferences ("Respond as briefly as possible")67- Add safety measures ("Respond with 'I can't help with that' for dangerous requests")68 69## Core Pattern — Guided Generation with @Generable70 71Generate structured Swift types instead of raw strings:72 73### 1. Define a Generable Type74 75```swift76@Generable(description: "Basic profile information about a cat")77struct CatProfile {78    var name: String79 80    @Guide(description: "The age of the cat", .range(0...20))81    var age: Int82 83    @Guide(description: "A one sentence profile about the cat's personality")84    var profile: String85}86```87 88### 2. Request Structured Output89 90```swift91let response = try await session.respond(92    to: "Generate a cute rescue cat",93    generating: CatProfile.self94)95 96// Access structured fields directly97print("Name: \(response.content.name)")98print("Age: \(response.content.age)")99print("Profile: \(response.content.profile)")100```101 102### Supported @Guide Constraints103 104- `.range(0...20)` — numeric range105- `.count(3)` — array element count106- `description:` — semantic guidance for generation107 108## Core Pattern — Tool Calling109 110Let the model invoke custom code for domain-specific tasks:111 112### 1. Define a Tool113 114```swift115struct RecipeSearchTool: Tool {116    let name = "recipe_search"117    let description = "Search for recipes matching a given term and return a list of results."118 119    @Generable120    struct Arguments {121        var searchTerm: String122        var numberOfResults: Int123    }124 125    func call(arguments: Arguments) async throws -> ToolOutput {126        let recipes = await searchRecipes(127            term: arguments.searchTerm,128            limit: arguments.numberOfResults129        )130        return .string(recipes.map { "- \($0.name): \($0.description)" }.joined(separator: "\n"))131    }132}133```134 135### 2. Create Session with Tools136 137```swift138let session = LanguageModelSession(tools: [RecipeSearchTool()])139let response = try await session.respond(to: "Find me some pasta recipes")140```141 142### 3. Handle Tool Errors143 144```swift145do {146    let answer = try await session.respond(to: "Find a recipe for tomato soup.")147} catch let error as LanguageModelSession.ToolCallError {148    print(error.tool.name)149    if case .databaseIsEmpty = error.underlyingError as? RecipeSearchToolError {150        // Handle specific tool error151    }152}153```154 155## Core Pattern — Snapshot Streaming156 157Stream structured responses for real-time UI with `PartiallyGenerated` types:158 159```swift160@Generable161struct TripIdeas {162    @Guide(description: "Ideas for upcoming trips")163    var ideas: [String]164}165 166let stream = session.streamResponse(167    to: "What are some exciting trip ideas?",168    generating: TripIdeas.self169)170 171for try await partial in stream {172    // partial: TripIdeas.PartiallyGenerated (all properties Optional)173    print(partial)174}175```176 177### SwiftUI Integration178 179```swift180@State private var partialResult: TripIdeas.PartiallyGenerated?181@State private var errorMessage: String?182 183var body: some View {184    List {185        ForEach(partialResult?.ideas ?? [], id: \.self) { idea in186            Text(idea)187        }188    }189    .overlay {190        if let errorMessage { Text(errorMessage).foregroundStyle(.red) }191    }192    .task {193        do {194            let stream = session.streamResponse(to: prompt, generating: TripIdeas.self)195            for try await partial in stream {196                partialResult = partial197            }198        } catch {199            errorMessage = error.localizedDescription200        }201    }202}203```204 205## Key Design Decisions206 207| Decision | Rationale |208|----------|-----------|209| On-device execution | Privacy — no data leaves the device; works offline |210| 4,096 token limit | On-device model constraint; chunk large data across sessions |211| Snapshot streaming (not deltas) | Structured output friendly; each snapshot is a complete partial state |212| `@Generable` macro | Compile-time safety for structured generation; auto-generates `PartiallyGenerated` type |213| Single request per session | `isResponding` prevents concurrent requests; create multiple sessions if needed |214| `response.content` (not `.output`) | Correct API — always access results via `.content` property |215 216## Best Practices217 218- **Always check `model.availability`** before creating a session — handle all unavailability cases219- **Use `instructions`** to guide model behavior — they take priority over prompts220- **Check `isResponding`** before sending a new request — sessions handle one request at a time221- **Access `response.content`** for results — not `.output`222- **Break large inputs into chunks** — 4,096 token limit applies to instructions + prompt + output combined223- **Use `@Generable`** for structured output — stronger guarantees than parsing raw strings224- **Use `GenerationOptions(temperature:)`** to tune creativity (higher = more creative)225- **Monitor with Instruments** — use Xcode Instruments to profile request performance226 227## Anti-Patterns to Avoid228 229- Creating sessions without checking `model.availability` first230- Sending inputs exceeding the 4,096 token context window231- Attempting concurrent requests on a single session232- Using `.output` instead of `.content` to access response data233- Parsing raw string responses when `@Generable` structured output would work234- Building complex multi-step logic in a single prompt — break into multiple focused prompts235- Assuming the model is always available — device eligibility and settings vary236 237## When to Use238 239- On-device text generation for privacy-sensitive apps240- Structured data extraction from user input (forms, natural language commands)241- AI-assisted features that must work offline242- Streaming UI that progressively shows generated content243- Domain-specific AI actions via tool calling (search, compute, lookup)

Related skills

Agent Eval

Install Agent Eval skill for Claude Code from affaan-m/everything-claude-code.

Agent Harness Construction

Install Agent Harness Construction skill for Claude Code from affaan-m/everything-claude-code.

Agent Payment X402

Install Agent Payment X402 skill for Claude Code from affaan-m/everything-claude-code.