Level 8 / Project 15 - Level 8 Mini Capstone¶

Learn Your Way¶

Read	Build	Watch	Test	Review	Visualize	Try
—	This project	—	—	Flashcards	—	—

Focus¶

Integration of multiple observability subsystems into one platform
Service registration with health tracking and metric collection
Threshold-based alerting with severity levels
Facade pattern for unified access to metrics, alerts, and health
Comprehensive reporting with drill-down per service

Why this project exists¶

This capstone integrates concepts from the entire level: KPI dashboards, response profiling, SLA monitoring, fault injection, and graceful degradation. Real observability platforms (Datadog, Grafana, New Relic) unify metrics, health checks, and alerting into a single pane of glass. This project builds a mini observability platform that monitors simulated services, detects degradation, generates alerts, and produces a unified health report — proving you can design systems that compose multiple subsystems into a coherent whole.

Run (copy/paste)¶

cd <repo-root>/projects/level-8/15-level8-mini-capstone
python project.py --demo
pytest -q

Expected terminal output¶

{
  "platform_health": "warning",
  "services": [...],
  "alerts": [...],
  "metrics_summary": {...}
}
7 passed

Expected artifacts¶

Console JSON output with full observability platform report
Passing tests
Updated notes.md

Alter it (required)¶

Add a dashboard endpoint that returns all services' health, alerts, and metrics in one JSON response.
Add alert severity levels (warning, critical) with different threshold multipliers.
Add a --simulate flag that generates random metrics and demonstrates real-time alerting.

Break it (required)¶

Record metrics for a service that was never registered — does record_metric handle it?
Set alert_threshold to 0 — does every metric trigger an alert?
Call generate_report() with no services registered — does it produce a valid report?

Fix it (required)¶

Add validation that services must be registered before recording metrics.
Guard against alert_threshold <= 0 with a minimum value.
Add a test for the empty-platform edge case.

Explain it (teach-back)¶

How does this capstone integrate metrics, alerting, and health into a unified platform?
What is the facade pattern and how does ObservabilityPlatform use it?
Why are mean and p95 both tracked — when does each matter?
How would you extend this to support distributed tracing across services?

Mastery check¶

You can move on when you can: - explain how the capstone combines patterns from projects 01-14, - add a new subsystem (e.g. log aggregation) to the platform without modifying existing code, - describe the three pillars of observability: metrics, logs, and traces, - design an alerting strategy that avoids alert fatigue while catching real incidents.

← Prev	Home	Next →