Name: Hybrid Search Implementation
Author: Wshobson
Install
Terminal · npx
$npx skills add https://github.com/wshobson/agents --skill hybrid-search-implementation
Works with Paperclip
How Hybrid Search Implementation fits into a Paperclip company.

Hybrid Search Implementation drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.
SaaS FactoryPaired
Pre-configured AI company — 18 agents, 18 skills, one-time purchase.
$27$59
Explore pack
Source file
SKILL.md564 linesmarkdown
Expand
1---2name: hybrid-search-implementation3description: Combine vector and keyword search for improved retrieval. Use when implementing RAG systems, building search engines, or when neither approach alone provides sufficient recall.4---5 6# Hybrid Search Implementation7 8Patterns for combining vector similarity and keyword-based search.9 10## When to Use This Skill11 12- Building RAG systems with improved recall13- Combining semantic understanding with exact matching14- Handling queries with specific terms (names, codes)15- Improving search for domain-specific vocabulary16- When pure vector search misses keyword matches17 18## Core Concepts19 20### 1. Hybrid Search Architecture21 22```23Query → ┬─► Vector Search ──► Candidates ─┐24        │                                  │25        └─► Keyword Search ─► Candidates ─┴─► Fusion ─► Results26```27 28### 2. Fusion Methods29 30| Method            | Description              | Best For        |31| ----------------- | ------------------------ | --------------- |32| **RRF**           | Reciprocal Rank Fusion   | General purpose |33| **Linear**        | Weighted sum of scores   | Tunable balance |34| **Cross-encoder** | Rerank with neural model | Highest quality |35| **Cascade**       | Filter then rerank       | Efficiency      |36 37## Templates38 39### Template 1: Reciprocal Rank Fusion40 41```python42from typing import List, Dict, Tuple43from collections import defaultdict44 45def reciprocal_rank_fusion(46    result_lists: List[List[Tuple[str, float]]],47    k: int = 60,48    weights: List[float] = None49) -> List[Tuple[str, float]]:50    """51    Combine multiple ranked lists using RRF.52 53    Args:54        result_lists: List of (doc_id, score) tuples per search method55        k: RRF constant (higher = more weight to lower ranks)56        weights: Optional weights per result list57 58    Returns:59        Fused ranking as (doc_id, score) tuples60    """61    if weights is None:62        weights = [1.0] * len(result_lists)63 64    scores = defaultdict(float)65 66    for result_list, weight in zip(result_lists, weights):67        for rank, (doc_id, _) in enumerate(result_list):68            # RRF formula: 1 / (k + rank)69            scores[doc_id] += weight * (1.0 / (k + rank + 1))70 71    # Sort by fused score72    return sorted(scores.items(), key=lambda x: x[1], reverse=True)73 74 75def linear_combination(76    vector_results: List[Tuple[str, float]],77    keyword_results: List[Tuple[str, float]],78    alpha: float = 0.579) -> List[Tuple[str, float]]:80    """81    Combine results with linear interpolation.82 83    Args:84        vector_results: (doc_id, similarity_score) from vector search85        keyword_results: (doc_id, bm25_score) from keyword search86        alpha: Weight for vector search (1-alpha for keyword)87    """88    # Normalize scores to [0, 1]89    def normalize(results):90        if not results:91            return {}92        scores = [s for _, s in results]93        min_s, max_s = min(scores), max(scores)94        range_s = max_s - min_s if max_s != min_s else 195        return {doc_id: (score - min_s) / range_s for doc_id, score in results}96 97    vector_scores = normalize(vector_results)98    keyword_scores = normalize(keyword_results)99 100    # Combine101    all_docs = set(vector_scores.keys()) | set(keyword_scores.keys())102    combined = {}103 104    for doc_id in all_docs:105        v_score = vector_scores.get(doc_id, 0)106        k_score = keyword_scores.get(doc_id, 0)107        combined[doc_id] = alpha * v_score + (1 - alpha) * k_score108 109    return sorted(combined.items(), key=lambda x: x[1], reverse=True)110```111 112### Template 2: PostgreSQL Hybrid Search113 114```python115import asyncpg116from typing import List, Dict, Optional117import numpy as np118 119class PostgresHybridSearch:120    """Hybrid search with pgvector and full-text search."""121 122    def __init__(self, pool: asyncpg.Pool):123        self.pool = pool124 125    async def setup_schema(self):126        """Create tables and indexes."""127        async with self.pool.acquire() as conn:128            await conn.execute("""129                CREATE EXTENSION IF NOT EXISTS vector;130 131                CREATE TABLE IF NOT EXISTS documents (132                    id TEXT PRIMARY KEY,133                    content TEXT NOT NULL,134                    embedding vector(1536),135                    metadata JSONB DEFAULT '{}',136                    ts_content tsvector GENERATED ALWAYS AS (137                        to_tsvector('english', content)138                    ) STORED139                );140 141                -- Vector index (HNSW)142                CREATE INDEX IF NOT EXISTS documents_embedding_idx143                ON documents USING hnsw (embedding vector_cosine_ops);144 145                -- Full-text index (GIN)146                CREATE INDEX IF NOT EXISTS documents_fts_idx147                ON documents USING gin (ts_content);148            """)149 150    async def hybrid_search(151        self,152        query: str,153        query_embedding: List[float],154        limit: int = 10,155        vector_weight: float = 0.5,156        filter_metadata: Optional[Dict] = None157    ) -> List[Dict]:158        """159        Perform hybrid search combining vector and full-text.160 161        Uses RRF fusion for combining results.162        """163        async with self.pool.acquire() as conn:164            # Build filter clause165            where_clause = "1=1"166            params = [query_embedding, query, limit * 3]167 168            if filter_metadata:169                for key, value in filter_metadata.items():170                    params.append(value)171                    where_clause += f" AND metadata->>'{key}' = ${len(params)}"172 173            results = await conn.fetch(f"""174                WITH vector_search AS (175                    SELECT176                        id,177                        content,178                        metadata,179                        ROW_NUMBER() OVER (ORDER BY embedding <=> $1::vector) as vector_rank,180                        1 - (embedding <=> $1::vector) as vector_score181                    FROM documents182                    WHERE {where_clause}183                    ORDER BY embedding <=> $1::vector184                    LIMIT $3185                ),186                keyword_search AS (187                    SELECT188                        id,189                        content,190                        metadata,191                        ROW_NUMBER() OVER (ORDER BY ts_rank(ts_content, websearch_to_tsquery('english', $2)) DESC) as keyword_rank,192                        ts_rank(ts_content, websearch_to_tsquery('english', $2)) as keyword_score193                    FROM documents194                    WHERE ts_content @@ websearch_to_tsquery('english', $2)195                      AND {where_clause}196                    ORDER BY ts_rank(ts_content, websearch_to_tsquery('english', $2)) DESC197                    LIMIT $3198                )199                SELECT200                    COALESCE(v.id, k.id) as id,201                    COALESCE(v.content, k.content) as content,202                    COALESCE(v.metadata, k.metadata) as metadata,203                    v.vector_score,204                    k.keyword_score,205                    -- RRF fusion206                    COALESCE(1.0 / (60 + v.vector_rank), 0) * $4::float +207                    COALESCE(1.0 / (60 + k.keyword_rank), 0) * (1 - $4::float) as rrf_score208                FROM vector_search v209                FULL OUTER JOIN keyword_search k ON v.id = k.id210                ORDER BY rrf_score DESC211                LIMIT $3 / 3212            """, *params, vector_weight)213 214            return [dict(row) for row in results]215 216    async def search_with_rerank(217        self,218        query: str,219        query_embedding: List[float],220        limit: int = 10,221        rerank_candidates: int = 50222    ) -> List[Dict]:223        """Hybrid search with cross-encoder reranking."""224        from sentence_transformers import CrossEncoder225 226        # Get candidates227        candidates = await self.hybrid_search(228            query, query_embedding, limit=rerank_candidates229        )230 231        if not candidates:232            return []233 234        # Rerank with cross-encoder235        model = CrossEncoder('cross-encoder/ms-marco-MiniLM-L-6-v2')236 237        pairs = [(query, c["content"]) for c in candidates]238        scores = model.predict(pairs)239 240        for candidate, score in zip(candidates, scores):241            candidate["rerank_score"] = float(score)242 243        # Sort by rerank score and return top results244        reranked = sorted(candidates, key=lambda x: x["rerank_score"], reverse=True)245        return reranked[:limit]246```247 248### Template 3: Elasticsearch Hybrid Search249 250```python251from elasticsearch import Elasticsearch252from typing import List, Dict, Optional253 254class ElasticsearchHybridSearch:255    """Hybrid search with Elasticsearch and dense vectors."""256 257    def __init__(258        self,259        es_client: Elasticsearch,260        index_name: str = "documents"261    ):262        self.es = es_client263        self.index_name = index_name264 265    def create_index(self, vector_dims: int = 1536):266        """Create index with dense vector and text fields."""267        mapping = {268            "mappings": {269                "properties": {270                    "content": {271                        "type": "text",272                        "analyzer": "english"273                    },274                    "embedding": {275                        "type": "dense_vector",276                        "dims": vector_dims,277                        "index": True,278                        "similarity": "cosine"279                    },280                    "metadata": {281                        "type": "object",282                        "enabled": True283                    }284                }285            }286        }287        self.es.indices.create(index=self.index_name, body=mapping, ignore=400)288 289    def hybrid_search(290        self,291        query: str,292        query_embedding: List[float],293        limit: int = 10,294        boost_vector: float = 1.0,295        boost_text: float = 1.0,296        filter: Optional[Dict] = None297    ) -> List[Dict]:298        """299        Hybrid search using Elasticsearch's built-in capabilities.300        """301        # Build the hybrid query302        search_body = {303            "size": limit,304            "query": {305                "bool": {306                    "should": [307                        # Vector search (kNN)308                        {309                            "script_score": {310                                "query": {"match_all": {}},311                                "script": {312                                    "source": f"cosineSimilarity(params.query_vector, 'embedding') * {boost_vector} + 1.0",313                                    "params": {"query_vector": query_embedding}314                                }315                            }316                        },317                        # Text search (BM25)318                        {319                            "match": {320                                "content": {321                                    "query": query,322                                    "boost": boost_text323                                }324                            }325                        }326                    ],327                    "minimum_should_match": 1328                }329            }330        }331 332        # Add filter if provided333        if filter:334            search_body["query"]["bool"]["filter"] = filter335 336        response = self.es.search(index=self.index_name, body=search_body)337 338        return [339            {340                "id": hit["_id"],341                "content": hit["_source"]["content"],342                "metadata": hit["_source"].get("metadata", {}),343                "score": hit["_score"]344            }345            for hit in response["hits"]["hits"]346        ]347 348    def hybrid_search_rrf(349        self,350        query: str,351        query_embedding: List[float],352        limit: int = 10,353        window_size: int = 100354    ) -> List[Dict]:355        """356        Hybrid search using Elasticsearch 8.x RRF.357        """358        search_body = {359            "size": limit,360            "sub_searches": [361                {362                    "query": {363                        "match": {364                            "content": query365                        }366                    }367                },368                {369                    "query": {370                        "knn": {371                            "field": "embedding",372                            "query_vector": query_embedding,373                            "k": window_size,374                            "num_candidates": window_size * 2375                        }376                    }377                }378            ],379            "rank": {380                "rrf": {381                    "window_size": window_size,382                    "rank_constant": 60383                }384            }385        }386 387        response = self.es.search(index=self.index_name, body=search_body)388 389        return [390            {391                "id": hit["_id"],392                "content": hit["_source"]["content"],393                "score": hit["_score"]394            }395            for hit in response["hits"]["hits"]396        ]397```398 399### Template 4: Custom Hybrid RAG Pipeline400 401```python402from typing import List, Dict, Optional, Callable403from dataclasses import dataclass404 405@dataclass406class SearchResult:407    id: str408    content: str409    score: float410    source: str  # "vector", "keyword", "hybrid"411    metadata: Dict = None412 413 414class HybridRAGPipeline:415    """Complete hybrid search pipeline for RAG."""416 417    def __init__(418        self,419        vector_store,420        keyword_store,421        embedder,422        reranker=None,423        fusion_method: str = "rrf",424        vector_weight: float = 0.5425    ):426        self.vector_store = vector_store427        self.keyword_store = keyword_store428        self.embedder = embedder429        self.reranker = reranker430        self.fusion_method = fusion_method431        self.vector_weight = vector_weight432 433    async def search(434        self,435        query: str,436        top_k: int = 10,437        filter: Optional[Dict] = None,438        use_rerank: bool = True439    ) -> List[SearchResult]:440        """Execute hybrid search pipeline."""441 442        # Step 1: Get query embedding443        query_embedding = self.embedder.embed(query)444 445        # Step 2: Execute parallel searches446        vector_results, keyword_results = await asyncio.gather(447            self._vector_search(query_embedding, top_k * 3, filter),448            self._keyword_search(query, top_k * 3, filter)449        )450 451        # Step 3: Fuse results452        if self.fusion_method == "rrf":453            fused = self._rrf_fusion(vector_results, keyword_results)454        else:455            fused = self._linear_fusion(vector_results, keyword_results)456 457        # Step 4: Rerank if enabled458        if use_rerank and self.reranker:459            fused = await self._rerank(query, fused[:top_k * 2])460 461        return fused[:top_k]462 463    async def _vector_search(464        self,465        embedding: List[float],466        limit: int,467        filter: Dict468    ) -> List[SearchResult]:469        results = await self.vector_store.search(embedding, limit, filter)470        return [471            SearchResult(472                id=r["id"],473                content=r["content"],474                score=r["score"],475                source="vector",476                metadata=r.get("metadata")477            )478            for r in results479        ]480 481    async def _keyword_search(482        self,483        query: str,484        limit: int,485        filter: Dict486    ) -> List[SearchResult]:487        results = await self.keyword_store.search(query, limit, filter)488        return [489            SearchResult(490                id=r["id"],491                content=r["content"],492                score=r["score"],493                source="keyword",494                metadata=r.get("metadata")495            )496            for r in results497        ]498 499    def _rrf_fusion(500        self,501        vector_results: List[SearchResult],502        keyword_results: List[SearchResult]503    ) -> List[SearchResult]:504        """Fuse with RRF."""505        k = 60506        scores = {}507        content_map = {}508 509        for rank, result in enumerate(vector_results):510            scores[result.id] = scores.get(result.id, 0) + 1 / (k + rank + 1)511            content_map[result.id] = result512 513        for rank, result in enumerate(keyword_results):514            scores[result.id] = scores.get(result.id, 0) + 1 / (k + rank + 1)515            if result.id not in content_map:516                content_map[result.id] = result517 518        sorted_ids = sorted(scores.keys(), key=lambda x: scores[x], reverse=True)519 520        return [521            SearchResult(522                id=doc_id,523                content=content_map[doc_id].content,524                score=scores[doc_id],525                source="hybrid",526                metadata=content_map[doc_id].metadata527            )528            for doc_id in sorted_ids529        ]530 531    async def _rerank(532        self,533        query: str,534        results: List[SearchResult]535    ) -> List[SearchResult]:536        """Rerank with cross-encoder."""537        if not results:538            return results539 540        pairs = [(query, r.content) for r in results]541        scores = self.reranker.predict(pairs)542 543        for result, score in zip(results, scores):544            result.score = float(score)545 546        return sorted(results, key=lambda x: x.score, reverse=True)547```548 549## Best Practices550 551### Do's552 553- **Tune weights empirically** - Test on your data554- **Use RRF for simplicity** - Works well without tuning555- **Add reranking** - Significant quality improvement556- **Log both scores** - Helps with debugging557- **A/B test** - Measure real user impact558 559### Don'ts560 561- **Don't assume one size fits all** - Different queries need different weights562- **Don't skip keyword search** - Handles exact matches better563- **Don't over-fetch** - Balance recall vs latency564- **Don't ignore edge cases** - Empty results, single word queries
Related skills
Accessibility Compliance

This walks you through implementing proper WCAG 2.2 compliance with real code patterns for screen readers, keyboard navigation, and mobile accessibility. It cov
Airflow Dag Patterns

If you're building data pipelines with Airflow, this skill gives you production-ready DAG patterns that actually work in the real world. It covers TaskFlow API
Angular Migration

Migrating from AngularJS to Angular is notoriously painful, and this skill tackles the practical stuff that makes or breaks these projects. It covers hybrid app