Releases: m-lab/prometheus-support
Releases · m-lab/prometheus-support
v2.61.5
What's Changed
- Adds new "fqdn" label to script-exporter metrics by @nkinkade in #954
- Specify the prometheus scrape interval by @stephen-soltesz in #957
- Smoother predictions for disk utilization by @stephen-soltesz in #956
Full Changelog: v2.61.4...v2.61.5
v2.61.4
What's Changed
- Update Boot_MachineFailedToBoot to ops-tracker repo by @cristinaleonr in #953
Full Changelog: v2.61.3...v2.61.4
v2.61.3
v2.61.2
v2.61.1
What's Changed
- Adds alert for when mlab-ns resp code metric is missing by @nkinkade in #946
- Removes resource definitions for prometheus by @nkinkade in #948
- Adds tolerations to run on prometheus pool nodes by @nkinkade in #949
- Add HBS monitoring panels by @cristinaleonr in #950
Full Changelog: v2.61.0...v2.61.1
v2.61.0
What's Changed
- Use GitHub provider with oauth2-proxy for prometheus & grafana by @stephen-soltesz in #940
- Deploy main branch by @stephen-soltesz in #941
- Adds a Cloud Build configuration file by @nkinkade in #938
- Update links referencing main branch by @stephen-soltesz in #942
- Remove "in progress" label by @stephen-soltesz in #943
- Use a custom image location for stackdriver_exporter by @nkinkade in #944
Full Changelog: v2.60.0...v2.61.0
v2.60.0
What's Changed
- Adds a new GKE Datasource variable to SiteOverview by @nkinkade in #932
- Add Gardener errors panel by @stephen-soltesz in #933
- Add gardener_jobs_total error rate to error panel by @stephen-soltesz in #934
- Add mixed data source for global test rates with locate queries by @stephen-soltesz in #935
- Add k8s v1.22 support for nginx IngressClass by @stephen-soltesz in #936
- Update GCS transfer alert to ignore archive-mlab-oti by @stephen-soltesz in #939
Full Changelog: v2.59.0...v2.60.0
v2.59.0
- Updates several dashboards to allow switching on site type #925
- Adds a new 'k8s: CoreDNS Overview' dashboard #926
- Lets a node be down for 1d before firing an alert #927
- Increase alert wait time to 3h for bqx frequency #928
- Updates EtcdOverview dashboard to use the new name for a metric #929
- Add initial dashboard for Locate service #930
- Add a new service discovery container using -project=LOCATE_PROJECT #931
v2.58.0
What's Changed
- Delete data-processing-cluster datasource from Grafana by @stephen-soltesz in #911
- Use promtool from docker image by @stephen-soltesz in #915
- Add other datatypes and set longer annotation threshold by @stephen-soltesz in #916
- Fixes the "Cluster Versions" panel in the SRE Overview dashboard by @nkinkade in #917
- Update Pipeline: Overview dashboard by @stephen-soltesz in #918
- Remove legacy datasource variable by @stephen-soltesz in #919
- Update v1beta1 k8s resources to v1 by @stephen-soltesz in #920
- Split GardenerFailureRateTooHighOrMissing into two alerts by @stephen-soltesz in #921
- Use log scale for uptime by @stephen-soltesz in #922
- Add average alerts last 30days by @stephen-soltesz in #923
- Bumps GMX image version to v1.4.0 by @nkinkade in #924
Full Changelog: v2.57.0...v2.58.0
v2.57.0
What's Changed
- Remove v1 gardener alert by @stephen-soltesz in #904
- Remove alert for annotation service by @stephen-soltesz in #905
- Updates cert-manager to most recent version (v1.8.0) by @nkinkade in #906
- Add downloader metrics to data-processing cluster by @stephen-soltesz in #907
- Update go version in .travis.yml to 1.18 by @robertodauria in #908
- Increase Prometheus disk size on sandbox by @robertodauria in #909
- Target the original 'locate' service in script-exporter by @cristinaleonr in #913
- Replaces datasource UID with $datasource by @nkinkade in #914
Full Changelog: v2.56.0...v2.57.0