Failure domain re-assignment on Cloudstack machine deploy failures #352
Labels
kind/feature
Categorizes issue or PR as related to a new feature.
lifecycle/rotten
Denotes an issue or PR that has aged beyond stale and will be auto-closed.
release:must-have
Milestone
/kind feature
Describe the solution you'd like
[A clear and concise description of what you want to happen.]
CAPC chooses a random failure domain to deploy worker machines. When this VM deploy fails, irrespective of the type of error, CAPC will keep re-attempting to deploy a VM until CAPI replaces the owner machine.
The proposed feature is to check for the failure errors when a VM deploy occurs, and if identified to be a terminal error(errors that are not transient or that wont go away with a re-try), choose another failure domain and deploy. Choosing the failure domain could be random as it is now, or can be based on resources available such as free IPs available.
This is only for worker machines, as control-plane failure domain assignment is handled by KCP.
The text was updated successfully, but these errors were encountered: