Overflows should emit logs and metrics #75

carsonip · 2023-08-08T16:13:21Z

Related to elastic/apm-server#11362 and elastic/apm-server#11117

Overflows should emit logs and metrics such that overflows are observable.
Create a dashboard to observe overflows.

carsonip · 2023-08-14T17:32:35Z

In apm-server for txmetrics we had:

	totalOverflow := metrics.servicesOverflow + metrics.perSvcTxnGroupsOverflow + metrics.txnGroupsOverflow
	monitoring.ReportInt(V, "active_groups", metrics.activeGroups)
	monitoring.ReportNamespace(V, "overflowed", func() {
		monitoring.ReportInt(V, "services", metrics.servicesOverflow)
		monitoring.ReportInt(V, "per_service_txn_groups", metrics.perSvcTxnGroupsOverflow)
		monitoring.ReportInt(V, "txn_groups", metrics.txnGroupsOverflow)
		monitoring.ReportInt(V, "total", totalOverflow)
	})

@axw do you think #86 provides sufficient insight into overflows? It does seem that it lacks the granularity that we had in the past, but I'm not sure if that's useful. Given that the next step will be to build dashboards on top of the new metrics in #86 , just wanted to confirm we are happy with what we have now after #86.

axw · 2023-08-15T00:57:47Z

I think we should go with what we have, and add to it as needed. It may not be complete, but I think what's there is correct.

Regarding what we used to have:

I think it should be possible to measure active_groups from the indexed metric documents
overflowed.total doesn't seem useful if we know the number of metric-type overflows
overflowed.per_service_txn_groups doesn't seem useful either, since it isn't per-service (and I think we probably shouldn't make it per-service, as it could lead to way too many metrics)

carsonip · 2023-08-15T12:49:19Z

Closing as all work on the topic has been completed.

carsonip mentioned this issue Aug 8, 2023

LSM-based aggregation elastic/apm-server#11117

Merged

5 tasks

carsonip self-assigned this Aug 11, 2023

carsonip mentioned this issue Aug 13, 2023

Emit logs and metrics about overflow #84

Closed

axw mentioned this issue Aug 14, 2023

Report aggregation overflows as OTel metrics #86

Merged

carsonip mentioned this issue Aug 14, 2023

Add logging for overflow #87

Merged

carsonip closed this as completed Aug 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overflows should emit logs and metrics #75

Overflows should emit logs and metrics #75

carsonip commented Aug 8, 2023

carsonip commented Aug 14, 2023

axw commented Aug 15, 2023

carsonip commented Aug 15, 2023

Overflows should emit logs and metrics #75

Overflows should emit logs and metrics #75

Comments

carsonip commented Aug 8, 2023

carsonip commented Aug 14, 2023

axw commented Aug 15, 2023

carsonip commented Aug 15, 2023