Querying

Use retriever() for dense, sparse, hybrid, custom, prompt, and tool-based retrieval.

retriever() is the read side of a retrieval corpus. It turns a text query into scored hits and can also inject those hits into prompts or expose search tools to a model.

import { retriever } from '@crux/core/retrieval'

const docs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  dense,
  search: { mode: 'dense', limit: 6 },
})

const hits = await docs.retrieve('how do refunds work?')

Use the same retriever in prompt composition:

const promptDocs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  dense,
  context: {
    query: ({ question }) => question,
    limit: 6,
  },
})

const answerDocs = prompt({
  id: 'answer-docs',
  use: [promptDocs],
  input: z.object({ question: z.string() }),
  system: 'Answer from the retrieved docs.',
  prompt: ({ input }) => input.question,
})

Choose A Search Mode

Dense is the right default for natural-language semantic similarity:

const docs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  dense,
  search: { mode: 'dense', limit: 5 },
})

Sparse is useful for exact terms, codes, identifiers, product names, symbols, and short keyword-style queries:

const docs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  sparse,
  search: { mode: 'sparse', limit: 5 },
})

await docs.retrieve('ERR_BILLING_CREDIT_409')

Hybrid is the recommended production default when your vector store supports it. Dense catches meaning; sparse catches exact terms:

const docs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  dense,
  sparse,
  search: {
    mode: 'hybrid',
    fusion: 'dbsf',
    limit: 8,
  },
})

Mode requirements are explicit:

Mode	Requires	Vector capability
`dense`	`dense`	`vectors.search({ dense })`
`sparse`	`sparse`	`vectors.search({ sparse })`
`hybrid`	`dense` and `sparse`	`vectors.search({ dense, sparse, fusion? })`

Unsupported combinations throw. Crux does not silently downgrade sparse or hybrid retrieval to dense-only search.

Inject Context Or Tools

Use inject: 'context' when the model should always receive retrieved context before generation:

const docs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  dense,
  inject: 'context',
  context: {
    query: ({ question }) => question,
    renderContext: (hits) =>
      hits
        .map((hit, index) => `[${index + 1}] ${hit.sourceId}/${hit.chunkId}\n${hit.content}`)
        .join('\n\n'),
  },
})

Use inject: 'tool' when the model should decide whether to search:

const searchableDocs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  dense,
  inject: 'tool',
  tools: {
    prefix: true,
    include: ['search', 'getSource'],
  },
})

const assistant = prompt({
  id: 'assistant',
  use: [searchableDocs],
  system: 'Use docsSearch before answering detailed product questions.',
})

Use inject: 'both' for assistants that should start with likely evidence and still have search tools available.

Manual helpers remain available for advanced integrations that do not resolve generic use entries:

const docsContext = docs.asContext({ query: ({ question }) => question })
const docsTools = docs.asTools({ prefix: true })

Prefer use: [docs] in normal Crux prompt code.

Rerank Results

Use a reranker when raw retrieval gets good candidates but not the best order.

import { reranker } from '@crux/core/retrieval'

const keepRelevant = reranker({
  name: 'keep-relevant',
  async rerank({ query, hits }) {
    const ranked = await rerankWithModel(query, hits)
    return ranked.slice(0, 5)
  },
})

const docs = retriever({
  id: 'docs',
  namespace: 'product-docs',
  data,
  vectors,
  dense,
  sparse,
  search: { mode: 'hybrid', limit: 20 },
  rerank: keepRelevant,
})

Rerankers run before retrieve() returns, before prompt context is rendered, and before tool output is returned.

When you change search mode, limits, rerankers, or store adapters, use RAG evals to check whether recall, precision, citations, and answer quality actually improved.

Custom Retrieval

Use a custom retriever when your backend is not a Crux DataStore plus VectorStore pair.

const docs = retriever({
  id: 'internal-search',
  namespace: 'product-docs',
  async retrieve(query, options) {
    const results = await internalSearch.search({
      query,
      limit: options.limit ?? 5,
      filter: options.filter,
    })

    return results.map((result) => ({
      namespace: 'product-docs',
      sourceId: result.documentId,
      chunkId: result.sectionId,
      content: result.text,
      metadata: result.metadata,
      score: result.score,
      sourceUrl: result.url,
    }))
  },
})

This is the supported escape hatch. Users with a mature search backend should not have to force it into a vector-store shape.

Choose A Search Mode

Inject Context Or Tools

Rerank Results

Custom Retrieval

Embeddings

Pipelines

Grounding

RAG evals

Retrieval Reference

On this page

Querying

Choose A Search Mode

Inject Context Or Tools

Rerank Results

Custom Retrieval

Related

Embeddings

Pipelines

Grounding

RAG evals

Retrieval Reference

On this page