How Ml Pipeline fits into a Paperclip company.

Ml Pipeline drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59

Explore pack

Source file

SKILL.md159 linesmarkdown

Expand

1---2name: ml-pipeline3description: "Designs and implements production-grade ML pipeline infrastructure: configures experiment tracking with MLflow or Weights & Biases, creates Kubeflow or Airflow DAGs for training orchestration, builds feature store schemas with Feast, deploys model registries, and automates retraining and validation workflows. Use when building ML pipelines, orchestrating training workflows, automating model lifecycle, implementing feature stores, managing experiment tracking systems, setting up DVC for data versioning, tuning hyperparameters, or configuring MLOps tooling like Kubeflow, Airflow, MLflow, or Prefect."4license: MIT5metadata:6  author: https://github.com/Jeffallan7  version: "1.1.0"8  domain: data-ml9  triggers: ML pipeline, MLflow, Kubeflow, feature engineering, model training, experiment tracking, feature store, hyperparameter tuning, pipeline orchestration, model registry, training workflow, MLOps, model deployment, data pipeline, model versioning10  role: expert11  scope: implementation12  output-format: code13  related-skills: devops-engineer, kubernetes-specialist, cloud-architect, python-pro14---15 16# ML Pipeline Expert17 18Senior ML pipeline engineer specializing in production-grade machine learning infrastructure, orchestration systems, and automated training workflows.19 20## Core Workflow21 221. **Design pipeline architecture** — Map data flow, identify stages, define interfaces between components232. **Validate data schema** — Run schema checks and distribution validation before any training begins; halt and report on failures243. **Implement feature engineering** — Build transformation pipelines, feature stores, and validation checks254. **Orchestrate training** — Configure distributed training, hyperparameter tuning, and resource allocation265. **Track experiments** — Log metrics, parameters, and artifacts; enable comparison and reproducibility276. **Validate and deploy** — Run model evaluation gates; implement A/B testing or shadow deployment before promotion28 29## Reference Guide30 31Load detailed guidance based on context:32 33| Topic | Reference | Load When |34|-------|-----------|-----------|35| Feature Engineering | `references/feature-engineering.md` | Feature pipelines, transformations, feature stores, Feast, data validation |36| Training Pipelines | `references/training-pipelines.md` | Training orchestration, distributed training, hyperparameter tuning, resource management |37| Experiment Tracking | `references/experiment-tracking.md` | MLflow, Weights & Biases, experiment logging, model registry |38| Pipeline Orchestration | `references/pipeline-orchestration.md` | Kubeflow Pipelines, Airflow, Prefect, DAG design, workflow automation |39| Model Validation | `references/model-validation.md` | Evaluation strategies, validation workflows, A/B testing, shadow deployment |40 41## Code Templates42 43### MLflow Experiment Logging (minimal reproducible example)44 45```python46import mlflow47import mlflow.sklearn48from sklearn.ensemble import RandomForestClassifier49from sklearn.model_selection import train_test_split50from sklearn.metrics import accuracy_score, f1_score51import numpy as np52 53# Pin random state for reproducibility54SEED = 4255np.random.seed(SEED)56 57mlflow.set_experiment("my-classifier-experiment")58 59with mlflow.start_run():60    # Log all hyperparameters — never hardcode silently61    params = {"n_estimators": 100, "max_depth": 5, "random_state": SEED}62    mlflow.log_params(params)63 64    model = RandomForestClassifier(**params)65    model.fit(X_train, y_train)66    preds = model.predict(X_test)67 68    # Log metrics69    mlflow.log_metric("accuracy", accuracy_score(y_test, preds))70    mlflow.log_metric("f1", f1_score(y_test, preds, average="weighted"))71 72    # Log and register the model artifact73    mlflow.sklearn.log_model(model, artifact_path="model",74                             registered_model_name="my-classifier")75```76 77### Kubeflow Pipeline Component (single-step template)78 79```python80from kfp.v2 import dsl81from kfp.v2.dsl import component, Input, Output, Dataset, Model, Metrics82 83@component(base_image="python:3.10", packages_to_install=["scikit-learn", "mlflow"])84def train_model(85    train_data: Input[Dataset],86    model_output: Output[Model],87    metrics_output: Output[Metrics],88    n_estimators: int = 100,89    max_depth: int = 5,90):91    import pandas as pd92    from sklearn.ensemble import RandomForestClassifier93    import pickle, json94 95    df = pd.read_csv(train_data.path)96    X, y = df.drop("label", axis=1), df["label"]97 98    model = RandomForestClassifier(n_estimators=n_estimators,99                                   max_depth=max_depth, random_state=42)100    model.fit(X, y)101 102    with open(model_output.path, "wb") as f:103        pickle.dump(model, f)104 105    metrics_output.log_metric("train_samples", len(df))106 107 108@dsl.pipeline(name="training-pipeline")109def training_pipeline(data_path: str, n_estimators: int = 100):110    train_step = train_model(n_estimators=n_estimators)111    # Chain additional steps (validate, register, deploy) here112```113 114### Data Validation Checkpoint (Great Expectations style)115 116```python117import great_expectations as ge118 119def validate_training_data(df):120    """Run schema and distribution checks. Raise on failure — never skip."""121    gdf = ge.from_pandas(df)122    results = gdf.expect_column_values_to_not_be_null("label")123    results &= gdf.expect_column_values_to_be_between("feature_1", 0, 1)124 125    if not results["success"]:126        raise ValueError(f"Data validation failed: {results['result']}")127    return df  # safe to proceed to training128```129 130## Constraints131 132**Always:**133- Version all data, code, and models explicitly (DVC, Git tags, model registry)134- Pin dependencies and random seeds for reproducible training environments135- Log all hyperparameters, metrics, and artifacts to experiment tracking136- Validate data schema and distribution before training begins137- Use containerized environments; store credentials in secrets managers, never in code138- Implement error handling, retry logic, and pipeline alerting139- Separate training and inference code clearly140 141**Never:**142- Run training without experiment tracking or without logging hyperparameters143- Deploy a model without recorded validation metrics144- Use non-reproducible random states or skip data validation145- Ignore pipeline failures silently or mix credentials into pipeline code146 147## Output Format148 149When implementing a pipeline, provide:1501. Complete pipeline definition (Kubeflow DAG, Airflow DAG, or equivalent) — use the templates above as starting structure1512. Feature engineering code with inline data validation calls1523. Training script with MLflow (or equivalent) experiment logging1534. Model evaluation code with explicit pass/fail thresholds1545. Deployment configuration and rollback strategy1556. Brief explanation of architecture decisions and reproducibility measures156 157## Knowledge Reference158 159MLflow, Kubeflow Pipelines, Apache Airflow, Prefect, Feast, Weights & Biases, Neptune, DVC, Great Expectations, Ray, Horovod, Kubernetes, Docker, S3/GCS/Azure Blob, model registry patterns, feature store architecture, distributed training, hyperparameter optimization

Related skills

Angular Architect

Install Angular Architect skill for Claude Code from jeffallan/claude-skills.

Api Designer

Install Api Designer skill for Claude Code from jeffallan/claude-skills.

Architecture Designer

Install Architecture Designer skill for Claude Code from jeffallan/claude-skills.