Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix/k8s-executor-logsdevelopment: Add Retry Logic to Executor Job and Logs #1141

Merged
merged 5 commits into from
Feb 5, 2025

Conversation

lbeckman314
Copy link
Member

@lbeckman314 lbeckman314 commented Jan 30, 2025

Overview ⚙️

When using the Kubernetes compute backend, the Funnel Worker fails when trying to read the logs of the Executor before the Executor Pod has fully started. This then results in a Error loop due to not having a retry mechanism in place.

Changes 🌀

This PR resolves this issue by adding retry mechanisms in two places:

  1. Creating the Executor Job

  2. Reading the Executor Logs

Credits 🤝

This PR is made possible by testing and development recommendations by the CTDS Team of Pauline Ribeyre, Jawad Qureshi, Sai Shanmukha Narumanchi, Aidan Hilt, and Ajo Augustine as part of the Gen3 Data Platform 🌱

@lbeckman314 lbeckman314 added this to the 0.11.1 milestone Jan 30, 2025
@lbeckman314 lbeckman314 self-assigned this Jan 30, 2025
Copy link

netlify bot commented Jan 30, 2025

Deploy Preview for ohsu-comp-bio-funnel canceled.

Name Link
🔨 Latest commit cc25140
🔍 Latest deploy log https://app.netlify.com/sites/ohsu-comp-bio-funnel/deploys/67a3d6d60cf34b0007c701fe

@lbeckman314 lbeckman314 changed the title WIP: Fix/k8s executor logs fix/k8sdevelopment: Add Retry Logic to Executor Job and Logs Jan 30, 2025
- The only way an error will arise is if we try to check the Executor pod's logs before or while it's being created. This commit resolves by simply waiting for the pod to be created.
@lbeckman314 lbeckman314 merged commit 48ce4e4 into develop Feb 5, 2025
7 of 8 checks passed
@lbeckman314 lbeckman314 deleted the fix/k8s-executor-logs branch February 5, 2025 21:24
@lbeckman314 lbeckman314 changed the title fix/k8sdevelopment: Add Retry Logic to Executor Job and Logs fix/k8s-executor-logsdevelopment: Add Retry Logic to Executor Job and Logs Feb 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant