COMPEL™ Body of Knowledge v2.5

COMPEL Glossary / online-evaluation

Online evaluation

Assessment of an AI system under live traffic using randomized or sequential experimental designs — A/B test, multi-armed bandit, canary, or interleaving.

What this means in practice

The only evaluation mode that measures true user-facing outcome; governance constraints include blast radius, reversibility, and regulatory exposure during the live test.

Synonyms

online test , live-traffic evaluation , production evaluation

See also

Offline evaluation — Assessment of an AI system against static datasets — training hold-out, validation set, benchmark corpus — without exposure to live user traffic.
Multi-armed bandit — An online experimentation strategy that shifts traffic toward better-performing variants during the experiment — trading statistical power for exploitation of early wins.

Related articles in the Body of Knowledge

Online Evaluation

Cite this article

Author:: FlowRidge Team
Publisher:: FlowRidge
First Published:: 2026
Work:: COMPEL AI Transformation Body of Knowledge

Academic (APA)

FlowRidge Team. (2026). Online evaluation — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge. Retrieved from https://www.compelframework.org/glossary/online-evaluation

BibTeX

@misc{compel-online-evaluation-2026,
  author = {{FlowRidge Team}},
  title = {Online evaluation — COMPEL Glossary},
  howpublished = {COMPEL AI Transformation Body of Knowledge},
  publisher = {FlowRidge},
  year = {2026},
  url = {https://www.compelframework.org/glossary/online-evaluation},
  note = {Governed by the COMPEL Framework License Agreement}
}

Plain text

FlowRidge Team. Online evaluation — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge, 2026. https://www.compelframework.org/glossary/online-evaluation

Need Chicago, IEEE, or MLA formats? See the full COMPEL Citation Guide for every supported format with copy-ready snippets.

This content is part of the COMPEL AI Transformation Body of Knowledge, governed by the COMPEL Framework License Agreement. See /license for terms.