-
Notifications
You must be signed in to change notification settings - Fork 913
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Descheduling support for FlinkDeployments #5987
Comments
Hi @deefreak thanks for your feedback!
What are your main changes? Can you explain them briefly? |
Hi @XiShanYongYe-Chang two changes are required:
|
Thanks @deefreak for your quick response. Would you like to share it at a community meeting? |
@XiShanYongYe-Chang sure. |
@XiShanYongYe-Chang We will have this meeting on 6th January 2025? Should I add it in the document of the community meeting? |
Yes, we will
Yes, you can do it. |
@XiShanYongYe-Chang I don't have edit access to the document, I have requested for it. |
Hi @deefreak, you can do it by:
|
Hi @deefreak, have you ever looked at the Application State Preservation feature which was designed exactly based on FlinkDeployments failover scenario. By the way, why is FlinkDeployment unschedulable? As far as I know, the Karmada scheduler will select a cluster with enough available resources, which would help reduce the chance that the workload can not started after deploying to cluster. |
@RainbowMango I wasn't aware of this. Just went through it. So basically we define some failover conditions in the propagation policy itself and accordingly, it takes rescheduling decisions. (Mostly if it is "Unhealthy"). I wasn't using this feature, instead when I checked descheduler code which takes decision for triggering the rescheduling, it checks if there are any pod in the cluster which is unschedulable. It only has support for Deployments. I tried adding support for flinkdeployment there which means descheduler will trigger a rescheduling if none of the pods belonging to flinkdeployments(job manager + task manager pods) is schedulable. |
Yes, you can find an example from #5788 (comment). |
Hi @deefreak I see you added an agenda to this week's community meeting. I'd love to meet you there, just want to know which time zone you are located in. I'm thinking of moving the meeting 1 hour earlier because it is kind of too late(midnight 00:00) for me. |
My timezone is IST. 1 hour earlier is fine for me. |
OK. I will send an email to the Karmada mail group and then update the calendar. |
What would you like to be added:
Currently, the descheduler only supports Deployment objects.
For our use case, we wanted to support FlinkDeployments and for that I have done changes and it has been working fine in our environment.
Why is this needed:
This is needed so that descheduler can support FlinkDeployments and reschedules it to other clusters if flinkdeployment pods are unschedulable.
I would like to contribute for this feature.
The text was updated successfully, but these errors were encountered: