🧠 SRE Lab – Observability & Monitoring Stack

This project sets up a hands-on Site Reliability Engineering (SRE) lab that demonstrates how to monitor applications and infrastructure using a modern open-source observability stack.

It includes preconfigured services for metrics, logs, and dashboards, designed to mirror production-grade reliability workflows.

🧩 Features

Prometheus – collects and stores time-series metrics from monitored services
Grafana – visualizes metrics and builds alert dashboards
Loki & Promtail – centralized log aggregation and querying
Docker Compose – orchestrates multi-container setup locally
Environment Variables (.env) – configurable ports, data paths, and credentials
Modular design – easily extendable to Kubernetes, Alertmanager, or Slack alerting

⚙️ Architecture Overview

                ┌────────────┐
                │  Promtail  │───► Logs ───► Loki
                └────────────┘
                       │
                       ▼
┌────────────┐    ┌────────────┐    ┌────────────┐
│  App (Flask│───►│ Prometheus │───►│  Grafana   │
└────────────┘    └────────────┘    └────────────┘

🚀 Getting Started

1️⃣ Clone the Repository

git clone https://github.com/pmoise1981/sre-lab.git
cd sre-lab

2️⃣ Create Environment File

Copy the example environment file:

cp .env.example .env

3️⃣ Start the Stack

docker compose up -d

Grafana will be available at: http://localhost:3000 Prometheus at: http://localhost:9090

📊 Prebuilt Dashboards

System Metrics Dashboard: CPU, memory, disk usage
Container Health Dashboard: Uptime, restart count, latency
Application Metrics (optional): Integrates with Flask or FastAPI exporters

🧱 Tech Stack

Prometheus · Grafana · Loki · Promtail · Docker Compose · Linux · .env

🧩 Future Enhancements

Add Alertmanager + Slack/Email alerts
Add Service-Level Indicators (SLIs) and Service-Level Objectives (SLOs)
Integrate OpenTelemetry exporters for tracing
Add Kubernetes manifests for production-grade orchestration

👨🏾‍💻 Author

Pierre Moise Site Reliability & DevOps Engineer | Observability, CI/CD, Cloud Automation 📎 GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
app		app
ops		ops
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 SRE Lab – Observability & Monitoring Stack

🧩 Features

⚙️ Architecture Overview

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Create Environment File

3️⃣ Start the Stack

📊 Prebuilt Dashboards

🧱 Tech Stack

🧩 Future Enhancements

👨🏾‍💻 Author

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

pmoise1981/sre-lab

Folders and files

Latest commit

History

Repository files navigation

🧠 SRE Lab – Observability & Monitoring Stack

🧩 Features

⚙️ Architecture Overview

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Create Environment File

3️⃣ Start the Stack

📊 Prebuilt Dashboards

🧱 Tech Stack

🧩 Future Enhancements

👨🏾‍💻 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages