Reduce cost for pangeo-hubs, three n2-highmem-8 core nodes running #3379

consideRatio · 2023-11-05T12:09:51Z

The pods running could fit on a single node computational capacity wise, I think they don't run on a single node because of a few pods with anti-affinity policies for example. This can be improved on greatly no matter what.

I think this must be related to having many konnectivity-agents that doesn't want to be scheduled together on nodes more than "1 skew", discussed in #2490. I reduced the autoscaling of these pods to not have more than 2 until at least the cluster had 10 nodes running to help avoid the need for more than 1 core node in most situations.

consideRatio · 2023-11-05T18:30:29Z

I've managed to get this down from 3 to 1 nodes, and it can be further reduced to a single n2-highmem-4 node instead of using the n2-highmem-8 nodes. A comment that this is more suitable is added in #3375.

consideRatio · 2023-11-05T18:33:42Z

Remaining action point

Change the core nodes for pangeo-hubs from n2-highmem-8 nodes to n2-highmem-4 nodes

consideRatio · 2024-06-26T09:01:02Z

Fixed by pangeo-hubs: upgrade k8s cluster, use smaller core nodes #3409

github-project-automation bot added this to DEPRECATED Engineering and Product Backlog Nov 5, 2023

github-project-automation bot moved this to Needs Shaping / Refinement in DEPRECATED Engineering and Product Backlog Nov 5, 2023

consideRatio added the tech:cloud-infra Optimization of cloud infra to reduce costs etc. label Nov 5, 2023

consideRatio self-assigned this Nov 5, 2023

damianavila added this to Sprint Board Nov 13, 2023

damianavila moved this to In Progress ⚡ in Sprint Board Nov 13, 2023

damianavila moved this from Needs Shaping / Refinement to In progress in DEPRECATED Engineering and Product Backlog Nov 13, 2023

yuvipanda added the allocation:internal-eng label Mar 20, 2024

consideRatio closed this as completed Jun 26, 2024

github-project-automation bot moved this from In Progress ⚡ to Done 🎉 in Sprint Board Jun 26, 2024

github-project-automation bot moved this from In progress to Complete in DEPRECATED Engineering and Product Backlog Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce cost for pangeo-hubs, three n2-highmem-8 core nodes running #3379

Reduce cost for pangeo-hubs, three n2-highmem-8 core nodes running #3379

consideRatio commented Nov 5, 2023 •

edited

Loading

consideRatio commented Nov 5, 2023

consideRatio commented Nov 5, 2023

consideRatio commented Jun 26, 2024

Reduce cost for pangeo-hubs, three n2-highmem-8 core nodes running #3379

Reduce cost for pangeo-hubs, three n2-highmem-8 core nodes running #3379

Comments

consideRatio commented Nov 5, 2023 • edited Loading

consideRatio commented Nov 5, 2023

consideRatio commented Nov 5, 2023

Remaining action point

consideRatio commented Jun 26, 2024

consideRatio commented Nov 5, 2023 •

edited

Loading