Was this page helpful?
Caution
You're viewing documentation for an unstable version of ScyllaDB Operator. Switch to the latest stable version.
Scale, add, remove racks¶
Change the number of ScyllaDB nodes in a rack or add entirely new racks to adjust capacity and throughput.
How scaling works¶
Each rack in a ScyllaDB cluster maps to a single Kubernetes StatefulSet. Scaling changes the replica count of that StatefulSet:
Scale up — new pods are appended at the end of the ordinal sequence (highest index). After the new node joins the token ring, the Operator automatically triggers a data cleanup on affected nodes.
Scale down — the Operator decommissions the highest-ordinal pod first, streams its data to the remaining nodes, reduces the replica count, and then deletes the PVC and Service. Only one node is decommissioned at a time.
Because StatefulSets use OrderedReady pod management, you cannot remove an arbitrary node from the middle of a rack.
If a specific node is unhealthy, use node replacement instead.
For background on the StatefulSet-per-rack architecture, see StatefulSets and racks.
Scale a ScyllaCluster¶
Change spec.datacenter.racks[].members to the desired node count and apply:
apiVersion: scylla.scylladb.com/v1
kind: ScyllaCluster
metadata:
name: scylla
namespace: scylla
spec:
datacenter:
name: us-east-1
racks:
- name: us-east-1a
members: 3 # was 1, now 3
storage:
capacity: 500Gi
apiVersion: scylla.scylladb.com/v1
kind: ScyllaCluster
metadata:
name: scylla
namespace: scylla
spec:
datacenter:
name: us-east-1
racks:
- name: us-east-1a
members: 1 # was 3, now 1
storage:
capacity: 500Gi
Wait for the operation to complete:
kubectl -n scylla wait --timeout=10m --for='condition=Available' scyllaclusters.scylla.scylladb.com/scylla
Verify with nodetool status:
kubectl -n scylla exec -it scylla-us-east-1a-0 -c scylla -- nodetool status
Add a rack to a ScyllaCluster¶
Append a new entry to the spec.datacenter.racks array.
The Operator creates racks in the order they appear and waits for each rack to be fully ready before creating the next.
apiVersion: scylla.scylladb.com/v1
kind: ScyllaCluster
metadata:
name: scylla
namespace: scylla
spec:
datacenter:
name: us-east-1
racks:
- name: us-east-1a
members: 3
storage:
capacity: 500Gi
- name: us-east-1b # new rack
members: 3
storage:
capacity: 500Gi
Note
Rack names serve as identity — they determine the StatefulSet and Service names. Choose rack names carefully, as renaming a rack requires removing it and creating a new one.
Remove a rack¶
Removing a rack is a two-step process. You must scale the rack to zero members first, wait for decommissioning to finish, and only then remove the rack definition from the spec.
Step 1: Scale the rack down to 0 members¶
Update the ScyllaCluster spec to set members: 0 for the rack being removed:
kubectl -n scylla patch scyllacluster scylla --type=json \
-p='[{"op":"replace","path":"/spec/datacenter/racks/<index>/members","value":0}]'
Replace <index> with the zero-based index of the rack in the racks array.
Wait for the Operator to decommission all nodes in the rack:
kubectl -n scylla wait --timeout=30m \
--for='condition=Available=True' scyllacluster/scylla
Verify all pods in the rack are gone:
kubectl -n scylla get pods -l scylla/rack=<rack-name>
Expected output: no pods listed.
Step 2: Remove the rack definition from the spec¶
Remove the rack entry from spec.datacenter.racks:
kubectl -n scylla edit scyllacluster scylla
Delete the entire rack entry. Save and apply.
Warning
Removing a rack is irreversible — any data that was stored on the rack’s nodes is streamed away during decommission.
After both steps, verify the cluster is healthy:
kubectl -n scylla wait --timeout=5m \
--for='condition=Available=True' scyllacluster/scylla
Note
In multi-DC clusters using multiple ScyllaCluster resources, each datacenter is scaled independently by editing its own ScyllaCluster resource.
Key considerations¶
Consideration |
Detail |
|---|---|
One at a time |
The Operator scales down one node at a time per rack, ensuring data is streamed away before the next decommission begins. |
Automatic cleanup |
After scaling completes, the Operator triggers data cleanup Jobs on affected nodes to remove data that no longer belongs to them. |
PVC deletion |
PVCs are deleted after scale-down. The Operator removes the PVC and Service of each decommissioned node after the replica count is reduced. |
Replication factor |
Ensure you do not scale below the replication factor of your keyspaces. ScyllaDB will refuse queries if replicas become unavailable. |
PodDisruptionBudget |
Each datacenter has a PDB with |
Run repair after scaling |
After significant scaling operations, run a repair to ensure data consistency across the new token ranges. |