Skip to main content

COMPEL Glossary / embedding-model

Embedding model

A model that maps text, images, or multimodal content to dense vector representations used for retrieval, clustering, and similarity search.

What this means in practice

Distinct from the concept of an embedding (a single vector): the embedding model is the artefact with a version, training corpus, dimensionality, and licensing footprint that governance must track.

Synonyms

text embedding model , encoder model

See also

  • Vector store — A governed index of embeddings — numeric vector representations of text, image, or multimodal content — that supports similarity search used by retrieval-augmented generation.
  • Embedding governance — Readiness criteria specific to vector-store operations — choice of chunking strategy, embedding-model version pinning, index refresh and re-embedding cadence on model upgrades, per-namespace access control, and retention/deletion workflow consistent with data-subject rights.
  • Chunking — The process of dividing documents into units — typically fixed-token windows or paragraph-level segments — suitable for embedding and retrieval.
  • Hybrid retrieval — A retrieval pattern combining dense (vector-based, e.g., dense passage retrieval) and sparse (term-based, e.g., BM25) retrieval methods, whose candidate sets are fused — typically via reciprocal rank fusion — before reranking.