What provenance contains
Each result includes aprovenance object:
| Field | Meaning |
|---|---|
tenant_id | The tenant that owns the record. |
dataset_id | The dataset it belongs to. |
source / source_key | The source and the specific object the record was ingested from. |
source_etag | The version of that source object, so you can tell which revision produced the record. |
line | The line or offset within the source object. |
transform_version | The parser/transform revision that produced the record. |
Why it matters
- Traceability. From any result you can find the exact source object and line, which makes debugging and auditing concrete rather than approximate.
- Reproducibility.
source_etagandtransform_versionpin a result to a specific revision of the data and the code that parsed it. - Isolation.
tenant_idon every record means tenant isolation holds at the data layer, not only at the query layer.
Provenance is attached at ingestion to every record, so it is present on results across all search modes.