Skip to content

Conversation

kalnem
Copy link

@kalnem kalnem commented Oct 10, 2025

Pull Request

Required Fields

🔎 What kind of change is it?

  • feat

🎯 What has been changed and why do we need it?

Added Kubernetes pod monitoring panels to Grafana dashboard

Changes:

  • Added "Unscheduled Pods" panel - tracks pods that cannot be assigned to nodes due to resource constraints or scheduling issues
  • Added "Container Issues" panel - monitors containers in CrashLoopBackOff, ErrImagePull, or ImagePullBackOff states

Why we need it:
These panels provide visibility into pod and container health issues that were previously not monitored:

  • Unscheduled pods indicate infrastructure problems (insufficient CPU/memory, node selector mismatches, resource quotas)
  • Container issues surface application-level problems (crashes, image pull failures, misconfigurations)

Early detection of these issues enables faster incident response and reduces downtime.

After:
Image 10 10 2025 at 08 58

  • Note: Fields in tables are removed in this image.

@kalnem kalnem requested a review from dotdc as a code owner October 10, 2025 12:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant