Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explore using the resilient creation/deletion features for VMSS to improve reliability of Helix VMs #4829

Open
3 tasks
riarenas opened this issue Jan 23, 2025 · 1 comment
Assignees

Comments

@riarenas
Copy link
Member

The VMSS team reached out to us to showcase the resilient VM creation/deletion feature. This feature ensures that all VM instance creation is retried up to five times. This should be helpful when dealing with OS provisioning timeouts and other infrastructure problems on the azure side.

Release Note Category

  • Feature changes/additions
  • Bug fixes
  • Internal Infrastructure Improvements

Release Note Description

@riarenas riarenas self-assigned this Jan 23, 2025
@riarenas
Copy link
Member Author

This feature has been enabled in the windows.11.arm64.open queue which our telemetry shows sees a lot of machines that never get to heartbeat and are afterwards cleaned up by the dead vm cleaner.

We will monitor the data to see if this creates any improvement over our current VM deletion process.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant