Name: Coreml
Author: Dpearson2699
Install
Terminal · npx
$npx skills add https://github.com/coreyhaines31/marketingskills --skill analytics-tracking
Works with Paperclip
How Coreml fits into a Paperclip company.

Coreml drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md458 linesmarkdown
Expand
1---2name: coreml3description: "Integrate and optimize Core ML models in iOS apps for on-device machine learning inference. Covers model loading (.mlmodelc, .mlpackage), predictions with auto-generated classes and MLFeatureProvider, compute unit configuration (CPU, GPU, Neural Engine), MLTensor, VNCoreMLRequest, MLComputePlan, multi-model pipelines, and deployment strategies. Use when loading Core ML models, making predictions, configuring compute units, or profiling model performance."4---5 6# Core ML Swift Integration7 8Load, configure, and run Core ML models in iOS apps. This skill covers the9Swift side: model loading, prediction, MLTensor, profiling, and deployment.10Target iOS 26+ with Swift 6.3, backward-compatible to iOS 14 unless noted.11 12> **Scope boundary:** Python-side model conversion, optimization (quantization,13> palettization, pruning), and framework selection live in the `apple-on-device-ai`14> skill. This skill owns Swift integration only.15 16See [references/coreml-swift-integration.md](references/coreml-swift-integration.md) for complete code patterns including17actor-based caching, batch inference, image preprocessing, and testing.18 19## Contents20 21- [Loading Models](#loading-models)22- [Model Configuration](#model-configuration)23- [Making Predictions](#making-predictions)24- [MLTensor (iOS 18+)](#mltensor-ios-18)25- [Working with MLMultiArray](#working-with-mlmultiarray)26- [Image Preprocessing](#image-preprocessing)27- [Multi-Model Pipelines](#multi-model-pipelines)28- [Vision Integration](#vision-integration)29- [Performance Profiling](#performance-profiling)30- [Model Deployment](#model-deployment)31- [Memory Management](#memory-management)32- [Common Mistakes](#common-mistakes)33- [Review Checklist](#review-checklist)34- [References](#references)35 36## Loading Models37 38### Auto-Generated Classes39 40When you drag a `.mlpackage` or `.mlmodelc` into Xcode, it generates a Swift41class with typed input/output. Use this whenever possible.42 43```swift44import CoreML45 46let config = MLModelConfiguration()47config.computeUnits = .all48 49let model = try MyImageClassifier(configuration: config)50```51 52### Manual Loading53 54Load from a URL when the model is downloaded at runtime or stored outside the55bundle.56 57```swift58let modelURL = Bundle.main.url(59    forResource: "MyModel", withExtension: "mlmodelc"60)!61let model = try MLModel(contentsOf: modelURL, configuration: config)62```63 64### Async Loading (iOS 16+)65 66Load models without blocking the main thread. Prefer this for large models.67 68```swift69let model = try await MLModel.load(70    contentsOf: modelURL,71    configuration: config72)73```74 75### Compile at Runtime76 77Compile a `.mlpackage` or `.mlmodel` to `.mlmodelc` on device. Useful for78models downloaded from a server.79 80```swift81let compiledURL = try await MLModel.compileModel(at: packageURL)82let model = try MLModel(contentsOf: compiledURL, configuration: config)83```84 85Cache the compiled URL -- recompiling on every launch wastes time. Copy86`compiledURL` to a persistent location (e.g., Application Support).87 88## Model Configuration89 90`MLModelConfiguration` controls compute units, GPU access, and model parameters.91 92### Compute Units Decision Table93 94| Value | Uses | When to Choose |95|---|---|---|96| `.all` | CPU + GPU + Neural Engine | Default. Let the system decide. |97| `.cpuOnly` | CPU | Background tasks, audio sessions, or when GPU is busy. |98| `.cpuAndGPU` | CPU + GPU | Need GPU but model has ops unsupported by ANE. |99| `.cpuAndNeuralEngine` | CPU + Neural Engine | Best energy efficiency for compatible models. |100 101```swift102let config = MLModelConfiguration()103config.computeUnits = .cpuAndNeuralEngine104 105// Allow low-priority background inference106config.computeUnits = .cpuOnly107```108 109### Configuration Properties110 111```swift112let config = MLModelConfiguration()113config.computeUnits = .all114config.allowLowPrecisionAccumulationOnGPU = true // faster, slight precision loss115```116 117## Making Predictions118 119### With Auto-Generated Classes120 121The generated class provides typed input/output structs.122 123```swift124let model = try MyImageClassifier(configuration: config)125let input = MyImageClassifierInput(image: pixelBuffer)126let output = try model.prediction(input: input)127print(output.classLabel)        // "golden_retriever"128print(output.classLabelProbs)   // ["golden_retriever": 0.95, ...]129```130 131### With MLDictionaryFeatureProvider132 133Use when inputs are dynamic or not known at compile time.134 135```swift136let inputFeatures = try MLDictionaryFeatureProvider(dictionary: [137    "image": MLFeatureValue(pixelBuffer: pixelBuffer),138    "confidence_threshold": MLFeatureValue(double: 0.5),139])140let output = try model.prediction(from: inputFeatures)141let label = output.featureValue(for: "classLabel")?.stringValue142```143 144### Async Prediction (iOS 17+)145 146```swift147let output = try await model.prediction(from: inputFeatures)148```149 150### Batch Prediction151 152Process multiple inputs in one call for better throughput.153 154```swift155let batchInputs = try MLArrayBatchProvider(array: inputs.map { input in156    try MLDictionaryFeatureProvider(dictionary: ["image": MLFeatureValue(pixelBuffer: input)])157})158let batchOutput = try model.predictions(from: batchInputs)159for i in 0..<batchOutput.count {160    let result = batchOutput.features(at: i)161    print(result.featureValue(for: "classLabel")?.stringValue ?? "unknown")162}163```164 165### Stateful Prediction (iOS 18+)166 167Use `MLState` for models that maintain state across predictions (sequence models,168LLMs, audio accumulators). Create state once and pass it to each prediction call.169 170```swift171let state = model.makeState()172 173// Each prediction carries forward the internal model state174for frame in audioFrames {175    let input = try MLDictionaryFeatureProvider(dictionary: [176        "audio_features": MLFeatureValue(multiArray: frame)177    ])178    let output = try await model.prediction(from: input, using: state)179    let classification = output.featureValue(for: "label")?.stringValue180}181```182 183State is not `Sendable` -- use it from a single actor or task. Call184`model.makeState()` to create independent state for concurrent streams.185 186## MLTensor (iOS 18+)187 188`MLTensor` is a Swift-native multidimensional array for pre/post-processing.189Operations run lazily -- call `.shapedArray(of:)` to materialize results.190 191```swift192import CoreML193 194// Creation195let tensor = MLTensor([1.0, 2.0, 3.0, 4.0])196let zeros = MLTensor(zeros: [3, 224, 224], scalarType: Float.self)197 198// Reshaping199let reshaped = tensor.reshaped(to: [2, 2])200 201// Math operations202let softmaxed = tensor.softmax()203let normalized = (tensor - tensor.mean()) / tensor.standardDeviation()204 205// Interop with MLMultiArray206let multiArray = try MLMultiArray([1.0, 2.0, 3.0, 4.0])207let fromMultiArray = MLTensor(multiArray)208let backToArray = tensor.shapedArray(of: Float.self)209```210 211## Working with MLMultiArray212 213`MLMultiArray` is the primary data exchange type for non-image model inputs and214outputs. Use it when the auto-generated class expects array-type features.215 216```swift217// Create a 3D array: [batch, sequence, features]218let array = try MLMultiArray(shape: [1, 128, 768], dataType: .float32)219 220// Write values221for i in 0..<128 {222    array[[0, i, 0] as [NSNumber]] = NSNumber(value: Float(i))223}224 225// Read values226let value = array[[0, 0, 0] as [NSNumber]].floatValue227 228// Create from data pointer for zero-copy interop229let data: [Float] = [1.0, 2.0, 3.0]230let fromData = try MLMultiArray(dataPointer: UnsafeMutableRawPointer(mutating: data),231                                 shape: [3],232                                 dataType: .float32,233                                 strides: [1])234```235 236See [references/coreml-swift-integration.md](references/coreml-swift-integration.md) for advanced MLMultiArray patterns237including NLP tokenization and audio feature extraction.238 239## Image Preprocessing240 241Image models expect `CVPixelBuffer` input. Use `CGImage` conversion for photos242from the camera or photo library. Vision's `VNCoreMLRequest` handles this243automatically; manual conversion is needed only for direct `MLModel` prediction.244 245```swift246import CoreVideo247 248func createPixelBuffer(from cgImage: CGImage, width: Int, height: Int) -> CVPixelBuffer? {249    var pixelBuffer: CVPixelBuffer?250    let attrs: [CFString: Any] = [251        kCVPixelBufferCGImageCompatibilityKey: true,252        kCVPixelBufferCGBitmapContextCompatibilityKey: true,253    ]254    CVPixelBufferCreate(kCFAllocatorDefault, width, height,255                        kCVPixelFormatType_32ARGB, attrs as CFDictionary, &pixelBuffer)256 257    guard let buffer = pixelBuffer else { return nil }258    CVPixelBufferLockBaseAddress(buffer, [])259    let context = CGContext(260        data: CVPixelBufferGetBaseAddress(buffer),261        width: width, height: height,262        bitsPerComponent: 8, bytesPerRow: CVPixelBufferGetBytesPerRow(buffer),263        space: CGColorSpaceCreateDeviceRGB(),264        bitmapInfo: CGImageAlphaInfo.noneSkipFirst.rawValue265    )266    context?.draw(cgImage, in: CGRect(x: 0, y: 0, width: width, height: height))267    CVPixelBufferUnlockBaseAddress(buffer, [])268    return buffer269}270```271 272For additional preprocessing patterns (normalization, center-cropping), see273[references/coreml-swift-integration.md](references/coreml-swift-integration.md).274 275## Multi-Model Pipelines276 277Chain models when preprocessing or postprocessing requires a separate model.278 279```swift280// Sequential inference: preprocessor -> main model -> postprocessor281let preprocessed = try preprocessor.prediction(from: rawInput)282let mainOutput = try mainModel.prediction(from: preprocessed)283let finalOutput = try postprocessor.prediction(from: mainOutput)284```285 286For Xcode-managed pipelines, use the pipeline model type in the `.mlpackage`.287Each sub-model runs on its optimal compute unit.288 289## Vision Integration290 291Use Vision to run Core ML image models with automatic image preprocessing292(resizing, normalization, color space, orientation).293 294### Modern: CoreMLRequest (iOS 18+)295 296```swift297import Vision298import CoreML299 300let model = try MLModel(contentsOf: modelURL, configuration: config)301let request = CoreMLRequest(model: .init(model))302let results = try await request.perform(on: cgImage)303 304if let classification = results.first as? ClassificationObservation {305    print("\(classification.identifier): \(classification.confidence)")306}307```308 309### Legacy: VNCoreMLRequest310 311```swift312let vnModel = try VNCoreMLModel(for: model)313let request = VNCoreMLRequest(model: vnModel) { request, error in314    guard let results = request.results as? [VNRecognizedObjectObservation] else { return }315    for observation in results {316        let label = observation.labels.first?.identifier ?? "unknown"317        let confidence = observation.labels.first?.confidence ?? 0318        let boundingBox = observation.boundingBox // normalized coordinates319        print("\(label): \(confidence) at \(boundingBox)")320    }321}322request.imageCropAndScaleOption = .scaleFill323 324let handler = VNImageRequestHandler(cvPixelBuffer: pixelBuffer)325try handler.perform([request])326```327 328> For complete Vision framework patterns (text recognition, barcode detection,329> document scanning), see the `vision-framework` skill.330 331## Performance Profiling332 333### MLComputePlan (iOS 17.4+)334 335Inspect which compute device each operation will use before running predictions.336 337```swift338let computePlan = try await MLComputePlan.load(339    contentsOf: modelURL, configuration: config340)341guard case let .program(program) = computePlan.modelStructure else { return }342guard let mainFunction = program.functions["main"] else { return }343 344for operation in mainFunction.block.operations {345    let deviceUsage = computePlan.deviceUsage(for: operation)346    let estimatedCost = computePlan.estimatedCost(of: operation)347    print("\(operation.operatorName): \(deviceUsage?.preferredComputeDevice ?? "unknown")")348}349```350 351### Instruments352 353Use the **Core ML** instrument template in Instruments to profile:354- Model load time355- Prediction latency (per-operation breakdown)356- Compute device dispatch (CPU/GPU/ANE per operation)357- Memory allocation358 359Run outside the debugger for accurate results (Xcode: Product > Profile).360 361## Model Deployment362 363### Bundle vs On-Demand Resources364 365| Strategy | Pros | Cons |366|---|---|---|367| Bundle in app | Instant availability, works offline | Increases app download size |368| On-demand resources | Smaller initial download | Requires download before first use |369| Background Assets (iOS 16+) | Downloads ahead of time | More complex setup |370| CloudKit / server | Maximum flexibility | Requires network, longer setup |371 372### Size Considerations373 374- App Store limit: 4 GB for app bundle375- Cellular download limit: 200 MB (can request exception)376- Use ODR tags for models > 50 MB377- Pre-compile to `.mlmodelc` to skip on-device compilation378 379```swift380// On-demand resource loading381let request = NSBundleResourceRequest(tags: ["ml-model-v2"])382try await request.beginAccessingResources()383let modelURL = Bundle.main.url(forResource: "LargeModel", withExtension: "mlmodelc")!384let model = try await MLModel.load(contentsOf: modelURL, configuration: config)385// Call request.endAccessingResources() when done386```387 388## Memory Management389 390- **Unload on background:** Release model references when the app enters background391  to free GPU/ANE memory. Reload on foreground return.392- **Use `.cpuOnly` for background tasks:** Background processing cannot use GPU or393  ANE; setting `.cpuOnly` avoids silent fallback and resource contention.394- **Share model instances:** Never create multiple `MLModel` instances from the same395  compiled model. Use an actor to provide shared access.396- **Monitor memory pressure:** Large models (>100 MB) can trigger memory warnings.397  Register for `UIApplication.didReceiveMemoryWarningNotification` and release398  cached models when under pressure.399 400See [references/coreml-swift-integration.md](references/coreml-swift-integration.md) for an actor-based model manager with401lifecycle-aware loading and cache eviction.402 403## Common Mistakes404 405**DON'T:** Load models on the main thread.406**DO:** Use `MLModel.load(contentsOf:configuration:)` async API or load on a background actor.407**Why:** Large models can take seconds to load, freezing the UI.408 409**DON'T:** Recompile `.mlpackage` to `.mlmodelc` on every app launch.410**DO:** Compile once with `MLModel.compileModel(at:)` and cache the compiled URL persistently.411**Why:** Compilation is expensive. Cache the `.mlmodelc` in Application Support.412 413**DON'T:** Hardcode `.cpuOnly` unless you have a specific reason.414**DO:** Use `.all` and let the system choose the optimal compute unit.415**Why:** `.all` enables Neural Engine and GPU, which are faster and more energy-efficient.416 417**DON'T:** Ignore `MLFeatureValue` type mismatches between input and model expectations.418**DO:** Match types exactly -- use `MLFeatureValue(pixelBuffer:)` for images, not raw data.419**Why:** Type mismatches cause cryptic runtime crashes or silent incorrect results.420 421**DON'T:** Create a new `MLModel` instance for every prediction.422**DO:** Load once and reuse. Use an actor to manage the model lifecycle.423**Why:** Model loading allocates significant memory and compute resources.424 425**DON'T:** Skip error handling for model loading and prediction.426**DO:** Catch errors and provide fallback behavior when the model fails.427**Why:** Models can fail to load on older devices or when resources are constrained.428 429**DON'T:** Assume all operations run on the Neural Engine.430**DO:** Use `MLComputePlan` (iOS 17.4+) to verify device dispatch per operation.431**Why:** Unsupported operations fall back to CPU, which may bottleneck the pipeline.432 433**DON'T:** Process images manually before passing to Vision + Core ML.434**DO:** Use `CoreMLRequest` (iOS 18+) or `VNCoreMLRequest` (legacy) to let Vision handle preprocessing.435**Why:** Vision handles orientation, scaling, and pixel format conversion correctly.436 437## Review Checklist438 439- [ ] Model loaded asynchronously (not blocking main thread)440- [ ] `MLModelConfiguration.computeUnits` set appropriately for use case441- [ ] Model instance reused across predictions (not recreated each time)442- [ ] Auto-generated class used when available (typed inputs/outputs)443- [ ] Error handling for model loading and prediction failures444- [ ] Compiled model cached persistently if compiled at runtime445- [ ] Image inputs use Vision pipeline (`CoreMLRequest` iOS 18+ or `VNCoreMLRequest`) for correct preprocessing446- [ ] `MLComputePlan` checked to verify compute device dispatch (iOS 17.4+)447- [ ] Batch predictions used when processing multiple inputs448- [ ] Model size appropriate for deployment strategy (bundle vs ODR)449- [ ] Memory tested on target devices (especially older devices with less RAM)450- [ ] Predictions run outside debugger for accurate performance measurement451 452## References453 454- Patterns and code: [references/coreml-swift-integration.md](references/coreml-swift-integration.md)455- Model conversion and optimization (Python-side): covered in the `apple-on-device-ai` skill456- Apple docs: [Core ML](https://sosumi.ai/documentation/coreml) |457  [MLModel](https://sosumi.ai/documentation/coreml/mlmodel) |458  [MLComputePlan](https://sosumi.ai/documentation/coreml/mlcomputeplan-1w21n)
Related skills
Alarmkit

Install Alarmkit skill for Claude Code from dpearson2699/swift-ios-skills.
App Clips

Install App Clips skill for Claude Code from dpearson2699/swift-ios-skills.
App Intents

Install App Intents skill for Claude Code from dpearson2699/swift-ios-skills.