fix: select best replica by ashotland · Pull Request #496 · dragonflydb/dragonfly-operator

ashotland · 2026-03-26T17:44:16Z

Fix master election to prefer replicas with data over empty restarted masters

Context

When a master pod crashes and restarts empty (no snapshot or corrupted data), the current selectMasterCandidate picks the lowest-ordinal ready pod — which could be the empty restarted master. All replicas then SLAVE OF the empty pod and lose their data.

Key constraint: older Dragonfly versions reset slave_repl_offset to 0 after a replica disconnects, so we cannot rely on offset > 0 alone to determine if a pod has data. Instead, we must check the pod's role first: pods with role=replica label were actively replicating and have data.

Selection priority

Prefer pods with role=replica label — they were actively replicating and have data
Among replicas, prefer highest slave_repl_offset (most up-to-date)
Tie-break by lowest pod ordinal
Fall back to non-replica pods (no role or role=master) only if no replicas are ready

Files to modify

internal/controller/util.go — add MasterCandidate struct, isBetterCandidate, update selectMasterCandidate
internal/controller/util_test.go — update tests for new selection logic
internal/controller/dragonfly_pod_lifecycle_controller.go — update call site to pass candidate builder
internal/controller/dragonfly_instance.go — add getMasterCandidate helper

Copilot

Pull request overview

Updates master election in the Dragonfly operator to avoid promoting an empty restarted master by preferring ready replicas (known to have data) and using replication offset as a secondary signal.

Changes:

Introduce MasterCandidate metadata and candidate comparison logic for deterministic master selection.
Update master-election call site to supply a candidate builder that queries INFO replication.
Expand unit tests to cover replica-preference and offset-based tie-breaking.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
internal/controller/util.go	Adds `MasterCandidate`, `isBetterCandidate`, and updates `selectMasterCandidate` to prefer replicas and higher offsets.
internal/controller/util_test.go	Updates/extends tests to validate the new candidate-selection priority rules.
internal/controller/dragonfly_pod_lifecycle_controller.go	Passes a candidate-builder callback into `selectMasterCandidate` during failover selection.
internal/controller/dragonfly_instance.go	Adds `getMasterCandidate` helper to derive `IsReplica` and `slave_repl_offset` via `INFO replication`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-30T06:15:48Z

internal/controller/util.go

+func selectMasterCandidate(pods []corev1.Pod, isReady func(*corev1.Pod) bool, getCandidate func(*corev1.Pod) MasterCandidate) *corev1.Pod {
+	var best *MasterCandidate

 	for i := range pods {
 		p := &pods[i]
 		if !isReady(p) {
 			continue
 		}

-		// Prefer Pod-0, then Pod-1, etc.
-		if bestCandidate == nil || getOrdinal(p.Name) < getOrdinal(bestCandidate.Name) {
-			bestCandidate = p
+		c := getCandidate(p)
+		if best == nil || isBetterCandidate(&c, best) {
+			best = &c
 		}


selectMasterCandidate stores best as a pointer to the loop-local variable c (best = &c). While safe due to escape analysis, it causes a heap allocation per update and is harder to reason about. Consider tracking the best candidate by value (plus a boolean) to avoid allocations and improve readability.

Copilot · 2026-03-30T06:15:48Z

internal/controller/dragonfly_instance.go

+func (dfi *DragonflyInstance) getMasterCandidate(ctx context.Context, pod *corev1.Pod) MasterCandidate {
+	c := MasterCandidate{Pod: pod, IsReplica: isReplica(pod)}
+	if pod.Status.PodIP == "" {
+		return c
+	}
+	info, err := dfi.getRedisClient(pod.Status.PodIP).Info(ctx, "replication").Result()
+	if err != nil {


getMasterCandidate always performs an INFO replication call even for non-replica pods, but non-replicas are only used as a fallback when no ready replicas exist. Skipping the INFO call when IsReplica is false would reduce network calls and speed up elections in large clusters.

Copilot · 2026-03-30T06:15:49Z

internal/controller/dragonfly_instance.go

+		return c
+	}
+	info, err := dfi.getRedisClient(pod.Status.PodIP).Info(ctx, "replication").Result()
+	if err != nil {
+		return c
+	}
+	data := parseInfoToMap(info)
+	if offsetStr, ok := data["slave_repl_offset"]; ok && offsetStr != "" {
+		if offset, err := strconv.ParseInt(offsetStr, 10, 64); err == nil {
+			c.Offset = offset
+		}
+	}


getMasterCandidate silently ignores INFO/parse failures and returns an offset of 0, which makes it hard to diagnose why elections are behaving unexpectedly (and can bias tie-breaks). Consider adding debug-level logging for INFO/parse errors and/or using a short per-call context timeout so master election can't block reconciliation for a long time when pods are slow/unreachable.

Suggested change

return c

}

info, err := dfi.getRedisClient(pod.Status.PodIP).Info(ctx, "replication").Result()

if err != nil {

return c

}

data := parseInfoToMap(info)

if offsetStr, ok := data["slave_repl_offset"]; ok && offsetStr != "" {

if offset, err := strconv.ParseInt(offsetStr, 10, 64); err == nil {

c.Offset = offset

}

}

// No IP means we cannot query Redis; log at debug level for diagnosability.

dfi.log.V(1).Info("skipping master candidate: pod has no IP", "pod", pod.Name)

return c

}

// Use a short per-call timeout so a slow/unreachable pod does not block reconciliation for long.

callCtx, cancel := context.WithTimeout(ctx, 2*time.Second)

defer cancel()

info, err := dfi.getRedisClient(pod.Status.PodIP).Info(callCtx, "replication").Result()

if err != nil {

// Log at debug level so INFO/parse errors are visible during elections.

dfi.log.V(1).Info("failed to get replication INFO for master candidate",

"pod", pod.Name, "ip", pod.Status.PodIP, "error", err)

return c

}

data := parseInfoToMap(info)

offsetStr, ok := data["slave_repl_offset"]

if !ok || offsetStr == "" {

dfi.log.V(1).Info("slave_repl_offset not found in replication INFO for master candidate",

"pod", pod.Name, "ip", pod.Status.PodIP)

return c

}

offset, err := strconv.ParseInt(offsetStr, 10, 64)

if err != nil {

dfi.log.V(1).Info("failed to parse slave_repl_offset for master candidate",

"pod", pod.Name, "ip", pod.Status.PodIP, "value", offsetStr, "error", err)

return c

}

c.Offset = offset

Copilot · 2026-03-30T06:15:49Z

internal/controller/util.go

+	// Among same role, prefer higher replication offset.
+	if a.Offset != b.Offset {


isBetterCandidate compares Offset even when both candidates are non-replicas. This conflicts with the documented selection priority (offset should only be considered among replicas) and can lead to unexpected behavior if a non-replica ever reports a non-zero slave_repl_offset. Consider only applying the offset comparison when IsReplica is true (or update the comment/priority to match the implemented behavior).

Suggested change

// Among same role, prefer higher replication offset.

if a.Offset != b.Offset {

// Among replicas, prefer higher replication offset.

if a.IsReplica && b.IsReplica && a.Offset != b.Offset {

fix: select best replica

0dab14a

ashotland requested review from Abhra303 and Copilot March 30, 2026 06:12

Copilot started reviewing on behalf of ashotland March 30, 2026 06:12 View session

Copilot AI reviewed Mar 30, 2026

View reviewed changes

ashotland marked this pull request as ready for review March 31, 2026 09:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: select best replica#496

fix: select best replica#496
ashotland wants to merge 1 commit intomainfrom
ari/selectBestReplica

ashotland commented Mar 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Copilot AI Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		// Among same role, prefer higher replication offset.
		if a.Offset != b.Offset {

Conversation

ashotland commented Mar 26, 2026

Fix master election to prefer replicas with data over empty restarted masters

Context

Selection priority

Files to modify

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants