Definition
Exact calculation formula
Unit of analysis
What you count: each question separately, or grouped per user?
Population
Which traces included
Segments
Which breakdowns matter
Sampling
Full population or sample (and why you might not evaluate every single trace)
Aggregation
How you summarize: average, worst-case (P95), or success-on-first-try?
Ownership
Pipeline, definition, investigation
Versioning
Refresh cadence, change tracking