Browse lessons

Week 1: Foundations and Economics

Week 2: Instrumentation and Reliability Engineering

Week 3: Rigorous Measurement of Output Success and Failure

Week 4: Metric Design and Business Outcome Linkage

Week 5: Pipelines, Experiments, and Continuous Validation

Week 6: Decision-Making and Organization

Week 2: Instrumentation and Reliability Engineering · Lesson 2.6

Designing instrumentation that scales with the product

How do we keep observability useful at scale without drowning in data?

Retired course. Due to the fast pace of AI, this course was retired before full release. Exercises, datasets, and videos referenced in this lesson are not available. The slide content and frameworks remain free to study.

Slide 1 of 19

Reader Notes

Lesson 2.3 built full instrumentation: spans, tool calls, intermediate outputs, confidence signals. Everything needed for debugging and evaluation. Full instrumentation. Zero compromises. And it works great in development. In staging. In a demo environment with 50 test queries. Now it's going to production scale. 100,000 queries per day. At that volume, logging everything on a managed observability platform becomes shockingly expensive. For teams using managed observability platforms, trace storage and evaluation can become a significant infrastructure cost, sometimes rivaling model inference itself. A common scenario: a team ships their beautiful instrumentation spec from development into production, and two months later the CFO is asking why the observability line item is bigger than cloud compute. That's not a conversation anyone wants to have. This lesson teaches how to design a sampling strategy (a plan for which traces to keep and which to skip) that preserves the ability to detect failures while keeping costs under control. The goal is to solve for three competing constraints simultaneously: evidence sufficiency, cost limits, and operational reliability. By the end, a production-ready sampling spec that can be handed to an infrastructure team will be complete. A document that justifies itself to the CFO. Not a guess. A calculation.

Go deeper with AI Analytics for Builders

5-week course: metrics, root cause analysis, experimentation, and storytelling. Think like a Product Data Scientist.

See full curriculum

Book 1-on-1 with Shane

30-minute AI evals Q&A. Talk through your specific evaluation challenges and get hands-on guidance.

Book 1-on-1 session

★

Finished all 36 lessons? Take the exam and get your free AI Evals certification.

→