Code of the Day
AdvancedObservability & Operations

Lab: diagnose in production

Choose the right signal and the right response when production misbehaves.

Lab · optionalFundamentalsAdvanced9 min
Recommended first
By the end of this lesson you will be able to:
  • Pick logs, metrics, or traces for a question
  • Alert on symptoms users feel
  • Prioritise correctly during an incident

Optional scenario lab. Operations is about seeing clearly and responding calmly. Practice both.

Scenarios: observability and incidents

  1. 1.
    You need to follow one slow request as it flows across several services. Which signal?
  2. 2.
    Which is the best thing to page a human about at 2am?
  3. 3.
    Production is down and you've confirmed it's a recent deploy. First priority?
  4. 4.
    A good postmortem focuses on systemic gaps (missing alert, fragile deploy), not on who typed the command.

See clearly, alert on what matters, mitigate before you diagnose, and learn without blame.

Finished reading? Mark it complete to track your progress.