COMPEL™ Body of Knowledge v2.5

COMPEL Glossary / serving-pattern

Serving pattern

The architectural shape of the inference path — managed API, cloud-platform hosted, self-hosted online, self-hosted batch, or edge.

What this means in practice

Each pattern has a characteristic cost profile, operational posture, data-residency footprint, and governance surface. Selection must precede the build-vs-buy decision.

Synonyms

AI serving architecture , inference-path pattern

See also

Model selection framework — An eight-criterion decision framework — capability, cost, latency, data residency, customization, operational maturity, exit cost, and license — for choosing a foundation model for a given use case.
Model routing — A pattern that routes each request to the cheapest model capable of handling it, escalating to more powerful models only when necessary — typically via a small classifier, confidence-based escalation, or response evaluation.
Data residency (AI) — The requirement that training data, retrieval data, and inference itself occur within a specified jurisdiction.
TTFT (time-to-first-token) — The latency from request submission to the first streamed output token.

Related articles in the Body of Knowledge

Model Serving Patterns and Inference Paths

Cite this article

Author:: FlowRidge Team
Publisher:: FlowRidge
First Published:: 2026
Work:: COMPEL AI Transformation Body of Knowledge

Academic (APA)

FlowRidge Team. (2026). Serving pattern — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge. Retrieved from https://www.compelframework.org/glossary/serving-pattern

BibTeX

@misc{compel-serving-pattern-2026,
  author = {{FlowRidge Team}},
  title = {Serving pattern — COMPEL Glossary},
  howpublished = {COMPEL AI Transformation Body of Knowledge},
  publisher = {FlowRidge},
  year = {2026},
  url = {https://www.compelframework.org/glossary/serving-pattern},
  note = {Governed by the COMPEL Framework License Agreement}
}

Plain text

FlowRidge Team. Serving pattern — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge, 2026. https://www.compelframework.org/glossary/serving-pattern

Need Chicago, IEEE, or MLA formats? See the full COMPEL Citation Guide for every supported format with copy-ready snippets.

This content is part of the COMPEL AI Transformation Body of Knowledge, governed by the COMPEL Framework License Agreement. See /license for terms.