-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Agent service doesn't get started and host not available under Agents tab on reboot when installed with --delay-enroll
.
#3961
Comments
Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane) |
@manishgupta-qasource Please review. |
Secondary review for this ticket is Done |
Does this happen with an English VM? If not I assume the root cause will turn out to be similar to #3960. |
Can you collect agent diagnostics? If you can't, can you attach a zip archive of the agent installation directory? There should be logs explaining why the delayed enrollment failed. |
Hi @cmacknz Thank you for looking into this. We have revalidated this issue on latest 8.12.0 BC4 and found it still reproducible. We are not able to fetch the logs for this agent: Hence, we have shared the agent directory zip over slack.
The issue is not reproducible on English VM. Build details: Please let us know if anything else is required from our end. Thanks!! |
Thanks, the agent directory zip unfortunately didn't contain any logs. My assumption is that this is the same problem as #3960. Let's retest once that issue is resolved to confirm. |
@amolnater-qasource could you please test this again now that we have BC5 with @leehinman 's fix? |
Hi Team, We have revalidated this issue on latest 8.12.0 BC5 kibana cloud environment and found it still reproducible. Observations:
Build details: Few logs are generated under agent installation directory, we have shared the same under #elastic-agent-team channel over slack. UPDATE:
Please let us know if we are missing anything here. Thanks! |
https://github.com/elastic/elastic-agent/commits/db9b8921b37139cbb1e11d23f6381f655edeb72b isn't a valid elastic-agent commit. Downloading that version and checking the version manually I get:
That is new enough to have the fix c02f65a |
Errors from the first set of logs: {"log.level":"info","@timestamp":"2024-01-08T08:43:29.186Z","log.origin":{"file.name":"cmd/run.go","file.line":157},"message":"Elastic Agent started","log":{"source":"elastic-agent"},"process.pid":3696,"agent.version":"8.12.0","ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:29.801Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":496},"message":"Starting enrollment to URL: https://bf5d27ec208440a4b53d82c8e9d9b005.fleet.europe-west1.gcp.cloud.es.io:443/","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:31.332Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 0","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:31.610Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 0 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Das System kann die angegebene Datei nicht finden.\"'. Waiting for 2s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:33.617Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 1","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:33.628Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 1 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Das System kann die angegebene Datei nicht finden.\"'. Waiting for 4s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:37.635Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 2","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:37.636Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 2 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Das System kann die angegebene Datei nicht finden.\"'. Waiting for 8s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:45.637Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 3","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:45.638Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 3 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Das System kann die angegebene Datei nicht finden.\"'. Waiting for 16s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:44:01.652Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 4","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:44:01.653Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 4 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Das System kann die angegebene Datei nicht finden.\"'. Waiting for 32s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:44:33.666Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":281},"message":"Elastic Agent might not be running; unable to trigger restart: could not reload agent's daemon, all retries failed. Last error: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Das System kann die angegebene Datei nicht finden.\"","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:44:33.692Z","log.origin":{"file.name":"cmd/run.go","file.line":162},"message":"failed to perform delayed enrollment: could not reload agent daemon, unable to trigger restart: could not reload agent's daemon, all retries failed. Last error: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Das System kann die angegebene Datei nicht finden.\"","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"} Logs from the second agent are the same: {"log.level":"info","@timestamp":"2024-01-08T08:43:13.801Z","log.origin":{"file.name":"cmd/run.go","file.line":157},"message":"Elastic Agent started","log":{"source":"elastic-agent"},"process.pid":3412,"agent.version":"8.12.0","ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:14.275Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":496},"message":"Starting enrollment to URL: https://bf5d27ec208440a4b53d82c8e9d9b005.fleet.europe-west1.gcp.cloud.es.io:443/","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:15.814Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 0","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:15.875Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 0 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Le fichier spécifié est introuvable.\"'. Waiting for 2s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:17.878Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 1","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:17.878Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 1 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Le fichier spécifié est introuvable.\"'. Waiting for 4s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:21.885Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 2","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:21.885Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 2 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Le fichier spécifié est introuvable.\"'. Waiting for 8s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:29.897Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 3","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:29.898Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 3 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Le fichier spécifié est introuvable.\"'. Waiting for 16s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T08:43:45.907Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 4","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:43:45.908Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 4 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Le fichier spécifié est introuvable.\"'. Waiting for 32s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:44:17.913Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":281},"message":"Elastic Agent might not be running; unable to trigger restart: could not reload agent's daemon, all retries failed. Last error: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Le fichier spécifié est introuvable.\"","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T08:44:17.931Z","log.origin":{"file.name":"cmd/run.go","file.line":162},"message":"failed to perform delayed enrollment: could not reload agent daemon, unable to trigger restart: could not reload agent's daemon, all retries failed. Last error: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: open \\\\\\\\.\\\\pipe\\\\elastic-agent-system: Le fichier spécifié est introuvable.\"","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"} |
I can reproduce this on an english Linux Debian 11 VM, I don't think this is related to the language: craig_mackenzie@cmackenzie-debian11-test:~$ sudo elastic-agent version
Binary: 8.12.0 (build: 8744ca087b3908495b0204e6c80dd9b4de92ccf5 at 2024-01-05 15:38:21 +0000 UTC)
Daemon: <failed to communicate>
could not get version. failed to communicate with running daemon: rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory"
Use --binary-only flag to skip trying to retrieve version from running daemon
craig_mackenzie@cmackenzie-debian11-test:~$ sudo cat /opt/Elastic/Agent/data/elastic-agent-8744ca/logs/elastic-agent-20240108.ndjson
{"log.level":"info","@timestamp":"2024-01-08T15:23:40.919Z","log.origin":{"file.name":"cmd/run.go","file.line":157},"message":"Elastic Agent started","log":{"source":"elastic-agent"},"process.pid":355,"agent.version":"8.12.0","ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T15:23:41.578Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":496},"message":"Starting enrollment to URL: https://2e6c294352624beda2f6830fada574fb.fleet.eastus2.staging.azure.foundit.no:443/","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T15:23:43.122Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 0","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T15:23:43.131Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 0 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory\"'. Waiting for 2s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T15:23:45.132Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 1","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T15:23:45.133Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 1 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory\"'. Waiting for 4s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T15:23:49.135Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 2","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T15:23:49.136Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 2 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory\"'. Waiting for 8s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T15:23:57.138Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 3","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T15:23:57.139Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 3 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory\"'. Waiting for 16s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-01-08T15:24:13.139Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":461},"message":"Restarting agent daemon, attempt 4","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T15:24:13.140Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":475},"message":"Restart attempt 4 failed: 'rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory\"'. Waiting for 32s","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T15:24:45.140Z","log.origin":{"file.name":"cmd/enroll_cmd.go","file.line":281},"message":"Elastic Agent might not be running; unable to trigger restart: could not reload agent's daemon, all retries failed. Last error: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory\"","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-01-08T15:24:45.140Z","log.origin":{"file.name":"cmd/run.go","file.line":162},"message":"failed to perform delayed enrollment: could not reload agent daemon, unable to trigger restart: could not reload agent's daemon, all retries failed. Last error: rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing: dial unix /run/elastic-agent.sock: connect: no such file or directory\"","log":{"source":"elastic-agent"},"ecs.version":"1.6.0"} |
--delay-enroll
.--delay-enroll
.
Hi @cmacknz The commits for the BC5 agent version are post merge, for the agent which we have tested and the issue was reproducible: Below commits are of the kibana BC5 build:
Could you please confirm if both kibana and agent versions should be post the merge or if we are missing anything here? Thanks |
Only the agent needs to be post merge |
Hi Team, We have revalidated this issue on latest 8.12.0 BC6 kibana cloud environment and found it fixed now. Observations:
Host: Build details: Hence we are marking this issue as QA:Validated. Thanks! |
Kibana Build details:
Host OS: Windows French, Inconsistently on Korean
Preconditions:
Steps to reproduce:
--delay-enroll
on Windows 10 French.NOTE:
Screenshot:
Expected Result:
Agent service should get started and host should be available under Agents tab on reboot when installed with
--delay-enroll
.Previously observed:
#2719
Logs:
No agent logs are available.
The text was updated successfully, but these errors were encountered: