Skip to content

Conversation

@MattDodsonEnglish
Copy link
Collaborator

No description provided.

| `baas-alpha` | 3 | 8 | 16 (at least) | Yes | 750 | High throughput and IOPS |
| `baas-zero` | 3 | 2 | 2 | Yes | 300 | High throughput and IOPS |
| `isa95` | 2 | 2 | 1 | NO | N/A | |
| `libre-core` | 3 | 1 | 2 | No | N/A | HA requires 2 pods, but 3 is to avoid hotkey issues and balance load |
Copy link
Contributor

@tomhollingworth tomhollingworth Nov 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

libre-core was renamed in v4 to isa95, its line item can be removed
bpmn-engine was renamed in v4 to workflow, its line item can be removed
libre-audit -> was folded into isa95 / workflow, its line item can be removed
router is no longer required, its line item can be removed

| `quest-db` | 1 | 4 | 8 | Yes | 250GB | High Throughput and IPOS |
| `restate` | 3 | | | Yes | 50 | High Throughput and IPOS |
| `appsmith` | 3 | 4 | | Yes | 50 | High Throughput and IPOS |
| `grafana`* | 3 | 0.5 | 2 | No | 20-50 | Storage can be in host or in object bucket. |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If grafana is defined below in the monitoring stack it can remove it from here


For high availability, Rhize recommends a **minimum of three nodes** with the following specifications.


Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove 2nd new line

| `router` | 2 | 1 | 2 | Yes | <1 | Requires volume to compose supergraph |
| `quest-db` | 1 | 4 | 8 | Yes | 250GB | High Throughput and IPOS |
| `restate` | 3 | | | Yes | 50 | High Throughput and IPOS |
| `appsmith` | 3 | 4 | | Yes | 50 | High Throughput and IPOS |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm unsure what HA appsmith involves. Need to follow up with those that know.

| `keycloak` | 2 | 1 | 2 | No | N/A | |
| `keycloak-postgres` | 2 | 1 | 2 | No | 200 | Runs in pod with `keycloak` |
| `router` | 2 | 1 | 2 | Yes | <1 | Requires volume to compose supergraph |
| `quest-db` | 1 | 4 | 8 | Yes | 250GB | High Throughput and IPOS |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm unsure what HA QuestDB involves. Need to follow up with those that know.

| CPU Speed (GHz) | 3.3 |
| vCPU per Node | 16 |
| Memory per node (GiB) | 32 (64 is better) |
| Persisted volumes | 12 |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Persisted volumes is actually higher ~16. I need to confirm what QuestDB/AppSmith HA invovles

However, some deployments prefer to separate monitoring to its own cluster.

| Service | Pods for HA (replica count) | vCPU cores per pod | Memory per pod | DiskSize (GiB) |
|-------------------------|-----------------------------|--------------------|----------------|----------------|
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might need to checking with the devops guys if this is still applicable in the latest tempo/loki/prom distributed helm-charts.

@MattDodsonEnglish MattDodsonEnglish merged commit dfb51f0 into main Dec 4, 2025
3 of 5 checks passed
@MattDodsonEnglish MattDodsonEnglish deleted the dodson/v4-cluster-sizing branch December 4, 2025 16:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants