receiver/prometheusreceiver: add option to fallback to collector starttime #36365

ridwanmsharif · 2024-11-14T02:37:51Z

Description

This change adds an option to the metric adjuster to use an approximation of the collector starttime as a fallback for the start time of scraped cumulative metrics. This is useful when no start time is found and when the collector starts up alongside its targets (like in serverless environments or sidecar approaches).

Link to tracking issue

Fixes #36364

Testing

Added unit test for this config option

Documentation

Config option added to the README.

…ttime This change adds an option to the metric adjuster to use an approximation of the collector starttime as a fallback for the start time of scraped cumulative metrics. This is useful when no start time is found and when the collector starts up alongside its targets (like in serverless environments or sidecar approaches). Signed-off-by: Ridwan Sharif <ridwanmsharif@google.com>

ArthurSens · 2024-11-19T19:22:32Z

The code itself looks correct!

To be completely honest, I'm pretty new to this component and haven't used it myself. I noticed we have waaaay too many fallbacks for Created Timestamps; that looks weird 🤔.

Do I understand correctly that the flow is like this:

If metric StartTimeUnixNano is set, use that to populate the Created Timestamp from the prometheus/client_golang SDK.
If StartTimeUnixNano is not set, get it from another metric called process_start_time_seconds (where does that come from?)
Finally, if we still don't have a timestamp, we use the collector start time.

Did I get the flow correctly?

It makes me wonder, when would an OpenTelemetry metric not have StartTimeUnixNano not set? I understand this is not a required field in the spec, but maybe we could work on making it required instead?

receiver/prometheusreceiver/README.md

dehaansa · 2024-11-24T05:35:16Z

receiver/prometheusreceiver/internal/starttimemetricadjuster.go

+func NewStartTimeMetricAdjuster(logger *zap.Logger, startTimeMetricRegex *regexp.Regexp, useCollectorStartTimeFallback bool) MetricsAdjuster {
+	var fallbackStartTime *time.Time
+	if useCollectorStartTimeFallback {
+		now := time.Now()


Rather than assume that this is always called at/near start time of collector, should we use the github.com/shirou/gopsutil/v4/host package to request uptime like hostmetricsreceiver does for boottime?

Is that correct in a containerized environment, or would it give the start time of the host?

I don't know the answer to that question, would need to be tested (I don't currently have capacity to test myself).

Aneurysm9 · 2024-11-25T16:52:27Z

receiver/prometheusreceiver/internal/starttimemetricadjuster.go

+func NewStartTimeMetricAdjuster(logger *zap.Logger, startTimeMetricRegex *regexp.Regexp, useCollectorStartTimeFallback bool) MetricsAdjuster {
+	var fallbackStartTime *time.Time
+	if useCollectorStartTimeFallback {
+		now := time.Now()


This won't be the collector start time, but the start time of this instance of the component. If the pipeline is stopped and restarted then this will result in a different timestamp even though the process has not restarted. This is a subtlety, but could be very confusing for someone using OpAMP to manage collector instances. I'm not sure whether there's a way to get a reliable process start time from the collector host. The system uptime is almost certainly not the correct value to use here. Perhaps the best option is just to clarify in the description of the new configuration field that the approximated start time will be relative to the component start time and not necessarily the collector start time.

can we populate it using an init function, or a variable outside of metric adjuster?

Populating a variable in an init function could work. It would be executed once near the start of the process.

Aneurysm9 · 2024-11-25T16:54:02Z

receiver/prometheusreceiver/internal/starttimemetricadjuster.go

+		if stma.fallbackStartTime == nil {
+			return err
+		}
+		stma.logger.Warn("Couldn't get start time for metrics. Using fallback start time.", zap.Error(err))


Does this need to be at the Warn level? Wouldn't this be a fairly high-rate log entry if none of the processed metrics have a start time?

Aneurysm9 · 2024-11-25T16:57:20Z

It makes me wonder, when would an OpenTelemetry metric not have StartTimeUnixNano not set? I understand this is not a required field in the spec, but maybe we could work on making it required instead?

This is the prometheus receiver, so the concern here is prometheus metrics not having a start time and we want to ensure that the pdata metrics produced by the receiver do have a start time. The flow you described appears correct, though it is in the context of populating StartTimeUnixNano instead of looking to it for a value.

github-actions · 2024-12-10T05:21:06Z

This PR was marked stale due to lack of activity. It will be closed in 14 days.

github-actions · 2024-12-25T05:21:27Z

This PR was marked stale due to lack of activity. It will be closed in 14 days.

ridwanmsharif requested review from dashpole and a team as code owners November 14, 2024 02:37

github-actions bot assigned bogdandrutu Nov 14, 2024

github-actions bot added the receiver/prometheus Prometheus receiver label Nov 14, 2024

github-actions bot requested a review from Aneurysm9 November 14, 2024 02:38

ridwanmsharif force-pushed the ridwanmsharif/starttime-fallback branch 2 times, most recently from 4ac0c41 to 05b85e8 Compare November 14, 2024 02:50

ridwanmsharif force-pushed the ridwanmsharif/starttime-fallback branch from 05b85e8 to d67a526 Compare November 14, 2024 15:15

ArthurSens reviewed Nov 19, 2024

View reviewed changes

receiver/prometheusreceiver/README.md Show resolved Hide resolved

dashpole approved these changes Nov 20, 2024

View reviewed changes

Merge branch 'main' into ridwanmsharif/starttime-fallback

59ec7db

dashpole added the enhancement New feature or request label Nov 21, 2024

dehaansa reviewed Nov 24, 2024

View reviewed changes

Aneurysm9 reviewed Nov 25, 2024

View reviewed changes

github-actions bot added the Stale label Dec 10, 2024

dashpole removed the Stale label Dec 10, 2024

github-actions bot added the Stale label Dec 25, 2024

dashpole removed the Stale label Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

receiver/prometheusreceiver: add option to fallback to collector starttime #36365

receiver/prometheusreceiver: add option to fallback to collector starttime #36365

ridwanmsharif commented Nov 14, 2024

ArthurSens commented Nov 19, 2024

dehaansa Nov 24, 2024

dashpole Nov 25, 2024

dehaansa Nov 25, 2024

Aneurysm9 Nov 25, 2024

dashpole Nov 25, 2024

Aneurysm9 Nov 25, 2024

Aneurysm9 Nov 25, 2024

Aneurysm9 commented Nov 25, 2024

github-actions bot commented Dec 10, 2024

github-actions bot commented Dec 25, 2024

receiver/prometheusreceiver: add option to fallback to collector starttime #36365

Are you sure you want to change the base?

receiver/prometheusreceiver: add option to fallback to collector starttime #36365

Conversation

ridwanmsharif commented Nov 14, 2024

Description

Link to tracking issue

Testing

Documentation

ArthurSens commented Nov 19, 2024

dehaansa Nov 24, 2024

Choose a reason for hiding this comment

dashpole Nov 25, 2024

Choose a reason for hiding this comment

dehaansa Nov 25, 2024

Choose a reason for hiding this comment

Aneurysm9 Nov 25, 2024

Choose a reason for hiding this comment

dashpole Nov 25, 2024

Choose a reason for hiding this comment

Aneurysm9 Nov 25, 2024

Choose a reason for hiding this comment

Aneurysm9 Nov 25, 2024

Choose a reason for hiding this comment

Aneurysm9 commented Nov 25, 2024

github-actions bot commented Dec 10, 2024

github-actions bot commented Dec 25, 2024