ScyllaDB Docs ScyllaDB Operator Operate Scale, add, remove racks

For AI agents: a documentation index is available at https://operator.docs.scylladb.com/master/llms.txt. A Markdown version of this page is at https://operator.docs.scylladb.com/master/operate/scale-add-remove-racks.md.

Caution

You're viewing documentation for an unstable version of ScyllaDB Operator. Switch to the latest stable version.

Scale, add, remove racks¶

Change the number of ScyllaDB nodes in a rack or add entirely new racks to adjust capacity and throughput.

How scaling works¶

Each rack in a ScyllaDB cluster maps to a single Kubernetes StatefulSet. Scaling changes the replica count of that StatefulSet:

Scale up — new pods are appended at the end of the ordinal sequence (highest index). After the new node joins the token ring, the Operator automatically triggers a data cleanup on affected nodes.
Scale down — the Operator decommissions the highest-ordinal pod first, streams its data to the remaining nodes, reduces the replica count, and then deletes the PVC and Service. Only one node is decommissioned at a time.

Because StatefulSets use OrderedReady pod management, you cannot remove an arbitrary node from the middle of a rack. If a specific node is unhealthy, use node replacement instead.

For background on the StatefulSet-per-rack architecture, see StatefulSets and racks.

Scale a ScyllaCluster¶

Change spec.datacenter.racks[].members to the desired node count and apply:

apiVersion: scylla.scylladb.com/v1
kind: ScyllaCluster
metadata:
  name: scylla
  namespace: scylla
spec:
  datacenter:
    name: us-east-1
    racks:
      - name: us-east-1a
        members: 3          # was 1, now 3
        storage:
          capacity: 500Gi

apiVersion: scylla.scylladb.com/v1
kind: ScyllaCluster
metadata:
  name: scylla
  namespace: scylla
spec:
  datacenter:
    name: us-east-1
    racks:
      - name: us-east-1a
        members: 1          # was 3, now 1
        storage:
          capacity: 500Gi

Wait for the operation to complete:

kubectl -n scylla wait --timeout=10m --for='condition=Available' scyllaclusters.scylla.scylladb.com/scylla

Verify with nodetool status:

kubectl -n scylla exec -it scylla-us-east-1a-0 -c scylla -- nodetool status

Add a rack to a ScyllaCluster¶

Append a new entry to the spec.datacenter.racks array. The Operator creates racks in the order they appear and waits for each rack to be fully ready before creating the next.

apiVersion: scylla.scylladb.com/v1
kind: ScyllaCluster
metadata:
  name: scylla
  namespace: scylla
spec:
  datacenter:
    name: us-east-1
    racks:
      - name: us-east-1a
        members: 3
        storage:
          capacity: 500Gi
      - name: us-east-1b         # new rack
        members: 3
        storage:
          capacity: 500Gi

Note

Rack names serve as identity — they determine the StatefulSet and Service names. Choose rack names carefully, as renaming a rack requires removing it and creating a new one.

Remove a rack¶

Removing a rack is a two-step process. You must scale the rack to zero members first, wait for decommissioning to finish, and only then remove the rack definition from the spec.

Step 1: Scale the rack down to 0 members¶

Update the ScyllaCluster spec to set members: 0 for the rack being removed:

kubectl -n scylla patch scyllacluster scylla --type=json \
  -p='[{"op":"replace","path":"/spec/datacenter/racks/<index>/members","value":0}]'

Replace <index> with the zero-based index of the rack in the racks array.

Wait for the Operator to decommission all nodes in the rack:

kubectl -n scylla wait --timeout=30m \
  --for='condition=Available=True' scyllacluster/scylla

Verify all pods in the rack are gone:

kubectl -n scylla get pods -l scylla/rack=<rack-name>

Expected output: no pods listed.

Step 2: Remove the rack definition from the spec¶

Remove the rack entry from spec.datacenter.racks:

kubectl -n scylla edit scyllacluster scylla

Delete the entire rack entry. Save and apply.

Warning

Removing a rack is irreversible — any data that was stored on the rack’s nodes is streamed away during decommission.

After both steps, verify the cluster is healthy:

kubectl -n scylla wait --timeout=5m \
  --for='condition=Available=True' scyllacluster/scylla

Note

In multi-DC clusters using multiple ScyllaCluster resources, each datacenter is scaled independently by editing its own ScyllaCluster resource.

Key considerations¶

Consideration	Detail
One at a time	The Operator scales down one node at a time per rack, ensuring data is streamed away before the next decommission begins.
Automatic cleanup	After scaling completes, the Operator triggers data cleanup Jobs on affected nodes to remove data that no longer belongs to them.
PVC deletion	PVCs are deleted after scale-down. The Operator removes the PVC and Service of each decommissioned node after the replica count is reduced.
Replication factor	Ensure you do not scale below the replication factor of your keyspaces. ScyllaDB will refuse queries if replicas become unavailable.
PodDisruptionBudget	Each datacenter has a PDB with `maxUnavailable: 1`. This does not block Operator-driven scaling but prevents concurrent pod evictions during node drains.
Run repair after scaling	After significant scaling operations, run a repair to ensure data consistency across the new token ranges.

Was this page helpful?

Scale, add, remove racks¶

How scaling works¶

Scale a ScyllaCluster¶

Add a rack to a ScyllaCluster¶

Remove a rack¶

Step 1: Scale the rack down to 0 members¶

Step 2: Remove the rack definition from the spec¶

Key considerations¶

Related pages¶