Weekly release: Add Ops Overview dashboard & Alertmanager with Basic Auth
This release includes:
Grafana Updates / Fixes:
- Add dashboard -- Ops: Platform Overview
- Increase the nginx ingress's proxy-connect-timeout to exceed the Prometheus query timeout. Should fix "Gateway Timeout" errors.
Alerts Changes
- Alertmanager links sent to slack will have basic auth credentials embedded. So, clicking on those links should "just work" without prompting for a username / password.
- Adds new alert case for NagiosExporterUnavailable
- Updates ParserDailyVolumeTooLow to only count rows that use status="ok".