Skip to content

Comments

[action] [PR:4197] [countersyncd]: Modify the exit behavior of the main function#4225

Open
mssonicbld wants to merge 1 commit intosonic-net:202511from
mssonicbld:cherry/202511/4197
Open

[action] [PR:4197] [countersyncd]: Modify the exit behavior of the main function#4225
mssonicbld wants to merge 1 commit intosonic-net:202511from
mssonicbld:cherry/202511/4197

Conversation

@mssonicbld
Copy link
Collaborator

What I did
The main function exits as soon as any actor terminates.

Why I did it
Otel actor may terminate due to failed to connect the otel collector. In the previous behavior, the main function will not exit because it's waiting for all actor terminating.

How I verified it
Check it locally:

[2026-02-04 11:16:01.248] [crates/countersyncd/src/actor/otel.rs:340] [WARN] Export attempt 29 failed: status: Unavailable, message: "tcp connect error", details: [], metadata: MetadataMap { headers: {} }
[2026-02-04 11:16:11.253] [crates/countersyncd/src/actor/otel.rs:340] [WARN] Export attempt 30 failed: status: Unavailable, message: "tcp connect error", details: [], metadata: MetadataMap { headers: {} }
[2026-02-04 11:16:21.255] [crates/countersyncd/src/actor/otel.rs:405] [ERROR] Failed to export buffered metrics (consecutive failures 30): OtelActorExportError("Max export retries exceeded")
[2026-02-04 11:16:21.256] [crates/countersyncd/src/actor/otel.rs:431] [INFO] Shutting down OtelActor...
[2026-02-04 11:16:22.257] [crates/countersyncd/src/actor/otel.rs:439] [INFO] OtelActor shutdown complete. 2339 messages, 0 exports, 1 failures
[2026-02-04 11:16:22.257] [crates/countersyncd/src/main.rs:421] [INFO] OpenTelemetry actor terminated
[2026-02-04 11:16:22.257] [crates/countersyncd/src/main.rs:464] [ERROR] OpenTelemetry actor failed: OtelActorExportError("Max export retries exceeded")

Details if related

<!--
Please make sure you have read and understood the contribution guildlines:
https://github.com/Azure/SONiC/blob/gh-pages/CONTRIBUTING.md

1. Make sure your commit includes a signature generted with `git commit -s`
2. Make sure your commit title follows the correct format: [component]: description
3. Make sure your commit message contains enough details about the change and related tests
4. Make sure your pull request adds related reviewers, asignees, labels

Please also provide the following information in this pull request:
-->

**What I did**
The main function exits as soon as any actor terminates.

**Why I did it**
Otel actor may terminate due to failed to connect the otel collector. In the previous behavior, the main function will not exit because it's waiting for all actor terminating.

**How I verified it**
Check it locally:
```
[2026-02-04 11:16:01.248] [crates/countersyncd/src/actor/otel.rs:340] [WARN] Export attempt 29 failed: status: Unavailable, message: "tcp connect error", details: [], metadata: MetadataMap { headers: {} }
[2026-02-04 11:16:11.253] [crates/countersyncd/src/actor/otel.rs:340] [WARN] Export attempt 30 failed: status: Unavailable, message: "tcp connect error", details: [], metadata: MetadataMap { headers: {} }
[2026-02-04 11:16:21.255] [crates/countersyncd/src/actor/otel.rs:405] [ERROR] Failed to export buffered metrics (consecutive failures 30): OtelActorExportError("Max export retries exceeded")
[2026-02-04 11:16:21.256] [crates/countersyncd/src/actor/otel.rs:431] [INFO] Shutting down OtelActor...
[2026-02-04 11:16:22.257] [crates/countersyncd/src/actor/otel.rs:439] [INFO] OtelActor shutdown complete. 2339 messages, 0 exports, 1 failures
[2026-02-04 11:16:22.257] [crates/countersyncd/src/main.rs:421] [INFO] OpenTelemetry actor terminated
[2026-02-04 11:16:22.257] [crates/countersyncd/src/main.rs:464] [ERROR] OpenTelemetry actor failed: OtelActorExportError("Max export retries exceeded")

```

**Details if related**
@mssonicbld
Copy link
Collaborator Author

Original PR: #4197

@mssonicbld
Copy link
Collaborator Author

/azp run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@Pterosaur
Copy link
Contributor

/azp run

@azure-pipelines
Copy link

Commenter does not have sufficient privileges for PR 4225 in repo sonic-net/sonic-swss

@Pterosaur
Copy link
Contributor

/azpw run

@mssonicbld
Copy link
Collaborator Author

/AzurePipelines run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants