Skip to content

Releases: m-lab/prometheus-support

fix the hard coded data source problem for rate limit dashboard

11 Apr 15:43
13d9d13
Compare
Choose a tag to compare

#431

remove hard coded project name from dashboard json file

Create new dashboard which calculate the counts & % that rate limits are triggered

11 Apr 15:00
ca32818
Compare
Choose a tag to compare

#429

Add sql to calculate the counts that rate limits are triggered

Alerts for platform-cluster & monitoring for six-hour mlab-ns clients

01 Apr 21:02
2db2fe0
Compare
Choose a tag to compare

Add bq exporter query to monitor mlab-ns 6-hour clients (#422 )
Expands k8s-prometheus scrape job and adds three new k8s-platform alerts (#420 & #421)

Removed prometheus 1.8 config, new alerts and dashboards

04 Mar 16:43
3443afd
Compare
Choose a tag to compare

Changes include:

  • Removed Prometheus 1.8 configuration
  • Updated Annotation & Gardener dashboards
  • Updated gcp-service-discovery version to v1.3.1
  • New alerts:
    • Epoxy server is not online
    • Nodes that fail to boot successfully
    • Too many AppEngine versions
    • Too many inactive AppEngine instances
  • Updates to k8s dashboards

Updated Grafana to v5.4.3

11 Feb 16:47
65330cd
Compare
Choose a tag to compare

Update GMX to 0.1.2

04 Feb 15:49
d3d8691
Compare
Choose a tag to compare

Updated GMX's version number to deploy new version including @nkinkade 's bugfixes.

Rerelease snmp scraping from internal GKE cluster

28 Jan 22:44
92ea533
Compare
Choose a tag to compare
Merge pull request #394 from m-lab/sandbox-roberto

Change high disk usage threshold for NPAD to 9GB

Revert "Use snmp service running on gke cluster"

08 Jan 16:20
533dc86
Compare
Choose a tag to compare
Merge pull request #385 from m-lab/sandbox-roberto

Revert "Use snmp service running on gke cluster"

snmp_exporter + kubeIP as k8s deployment

08 Jan 12:17
2fe6994
Compare
Choose a tag to compare

Changes include:

  • Added snmp_exporter and kubeIP as Kubernetes deployments
  • Added rebot on EB as scraping target
  • Several improvements to monitoring and alerts
  • Disable collection of gardener traceroute metrics
  • Improvements to dashboards

Dashboard improvements, jsonlint and monitoring of SSH on port 22

12 Dec 16:19
48257da
Compare
Choose a tag to compare

Changes include:

  • Added GMX and lame-duck status for each node in the Ops: Pod overview dashboard
  • Added monitoring of SSH running on port 22
  • Fixed CPU panel & several improvements to the Pipeline Annotation Service dashboard
  • Added jsonlint to .travis.ci