COMPEL™ Body of Knowledge v2.5

COMPEL Glossary / data-leakage

Data leakage

Information from the test or validation set inadvertently entering training — through preprocessing, feature engineering, target encoding, or time-ordered splits — inflating offline metrics and producing over-optimistic ship decisions.

What this means in practice

A leading cause of offline-to-online performance gaps; defense requires disciplined split protocols and temporal holdouts.

Synonyms

target leakage , feature leakage , evaluation leakage

See also

Offline evaluation — Assessment of an AI system against static datasets — training hold-out, validation set, benchmark corpus — without exposure to live user traffic.
Benchmark contamination — The presence of benchmark test data in foundation-model training corpora — whether through web crawling or deliberate inclusion — inflating reported benchmark scores and breaking the comparability of benchmark results across models.
Reproducibility — The property that re-running an experiment with the same code, data, and configuration produces the same results within declared tolerance.

Cite this article

Author:: FlowRidge Team
Publisher:: FlowRidge
First Published:: 2026
Work:: COMPEL AI Transformation Body of Knowledge

Academic (APA)

FlowRidge Team. (2026). Data leakage — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge. Retrieved from https://www.compelframework.org/glossary/data-leakage

BibTeX

@misc{compel-data-leakage-2026,
  author = {{FlowRidge Team}},
  title = {Data leakage — COMPEL Glossary},
  howpublished = {COMPEL AI Transformation Body of Knowledge},
  publisher = {FlowRidge},
  year = {2026},
  url = {https://www.compelframework.org/glossary/data-leakage},
  note = {Governed by the COMPEL Framework License Agreement}
}

Plain text

FlowRidge Team. Data leakage — COMPEL Glossary. COMPEL AI Transformation Body of Knowledge. FlowRidge, 2026. https://www.compelframework.org/glossary/data-leakage

Need Chicago, IEEE, or MLA formats? See the full COMPEL Citation Guide for every supported format with copy-ready snippets.

This content is part of the COMPEL AI Transformation Body of Knowledge, governed by the COMPEL Framework License Agreement. See /license for terms.