Skip to main content

COMPEL Glossary / provenance

Provenance

The record of origin and custody for a data asset — who collected it, from whom, under what legal basis, and through which hands it passed — required for auditability of high-risk AI under EU AI Act Article 10.

What this means in practice

Distinct from lineage: provenance is about origin and custody; lineage is about transformation history.

Synonyms

data provenance , dataset origin record

See also

  • Datasheet for datasets — A structured dataset documentation artifact covering motivation, composition, collection process, preprocessing, uses, distribution, and maintenance — modeled after electronic-component datasheets.
  • Third-party data readiness — The extension of data-readiness assessment to data supplied by vendors, partners, open-source corpora, or scraped sources — covering provenance, legal basis, contractual terms, known bias profile, and re-use constraints.
  • Data contract — A versioned, testable specification of a data product's schema, semantics, quality expectations, SLA, and change-management policy — published by the producer, consumable by downstream AI workloads.

Related articles in the Body of Knowledge