Skip to content

Releases: m-lab/prometheus-support

New GCP object names + reboot-api updates

15 Jul 19:38
18db57f
Compare
Choose a tag to compare

This release features two items:

  • Prometheus will now monitor the newly renamed GCP objects in the platform cluster.
  • The reboot API is now at v0.1.2, and exposed Prometheus metrics on port 9990.

Updates legacy alerts to not fire on migrated machines

18 Jun 20:24
e60a123
Compare
Choose a tag to compare

We have some alerts configured for legacy metrics that will begin to fire as we migrate machines to the platform cluster. This PR includes updates to some of those legacy alerts which should prevent them from firing if the machine is already part of the platform cluster.

weekly release

18 Jun 15:13
c28e8c9
Compare
Choose a tag to compare

Add blackbox-exporter-ipv6 vm to Ops_OamOverview dashboard
#488

Adds scraping of the Linode VM for IPv6 monitoring.
#487

Break out 4xx and 5xx errors into different lines
#485

Docker image version updates + assorted fixes

10 Jun 18:06
0ceb8f4
Compare
Choose a tag to compare
  • snmp_exporter Docker image is now v0.15.0
  • gcp-service-discovery Docker image is now v1.5.0
  • Adds new alerts for the "host" experiment.
  • Removes alerts for nodeinfo now that nodeinfo is part of the "host" experiment.
  • Adds absent() alert for etcd metrics.

GMX and alertmanager-receiver updates + SRE rotation dashboard

03 Jun 17:28
3c3f929
Compare
Choose a tag to compare
v2.14.0

Upgrades GMX's image version from v0.1.2 to v0.2.0. (#471)

robot image and grafana upgrade

29 May 17:38
7121276
Compare
Choose a tag to compare

New alerts, dashboard fixes, updated reboot-api and snmp-exporter deployment

20 May 19:26
b731352
Compare
Choose a tag to compare
v2.12.0

Update reboot-api deployment to v0.1.1. (#465)

Rebot deployment

13 May 15:37
8094be8
Compare
Choose a tag to compare

Adds rebot deployment and changes to alerts.

Dashboard improvements and Reboot API deployment

06 May 15:59
c08de6a
Compare
Choose a tag to compare

Main changes:

  • Platform cluster etcd alerts + new WorkloadOverview dashboard
  • Improvements to the Pipeline Batch Throughput dashboard
  • Reboot API deployment

lower annotation rate bar for NDT tests & Update version of gcp-service-discovery

15 Apr 18:46
81f3939
Compare
Choose a tag to compare

#438
Update version of gcp-service-discovery

#432
low the triggering bar for annotation rate too low to 98% from 99%