Scaling Deployments

Learn to scale deployments using a YAML file and will discuss automated scaling briefly.

We'll cover the following

Scaling using YAML files
Looking into the file
Applying the definition
- Verification
Automated scaling
Scaling the deployment
- Verification
Destroying Everything
Try it yourself

Scaling using YAML files#

There are quite a few different ways we can scale Deployments. Everything we do in this section is not unique to Deployments and can be applied to any Controller, like ReplicaSet, and those we did not yet explore.

If we decide that the number of replicas changes with relatively low frequency or that Deployments are performed manually, the best way to scale is to write a new YAML file or, even better, modify the existing one. Assuming that we store YAML files in a code repository, by updating existing files we have a documented and reproducible definition of the objects running inside a cluster.

We already performed scaling when we applied the definition from the go-demo-2-scaled.yml. We’ll do something similar, but with Deployments.

Looking into the file#

Let’s take a look at go-demo-2-scaled.yml for deployments. We won’t display the contents of the whole file since it is almost identical go-demo-2.yml. The only difference is the number of replicas of the go-demo-2-api Deployment.

Enter to Rename, Shift+Enter to Preview

Definition of 'go-demo-2-scaled'

Applying the definition#

At the moment, we’re running three replicas. Once we apply the new definition, it should increase to five.

Enter to Rename, Shift+Enter to Preview

Create 'go-demo-2-scaled'

Please note that, even though the file is different, the names of the resources are the same so kubectl apply did not create new objects. Instead, it updated those that changed. In particular, it changed the number of replicas of the go-demo-2-api Deployment.

Verification#

Let’s confirm that there are indeed five replicas of the Pods controlled through the Deployment.

Enter to Rename, Shift+Enter to Preview

Get Details of 'go-demo-2-scaled'

The output, limited to the deploy/go-demo-2-api, is as follows.

Enter to Rename, Shift+Enter to Preview

Output of kubectl get

The result should come as no surprise. After all, we executed the same process before, when we explored ReplicaSets.

Automated scaling#

While scaling Deployments using YAML files (or other Controllers) is an excellent way to keep documentation accurate, it rarely fits the dynamic nature of the clusters. We should aim for a system that will scale (and de-scale) services automatically.

When scaling is frequent and, hopefully, automated, we cannot expect to update YAML definitions and push them to Git. That would be too inefficient and would probably cause quite a few unwanted executions of delivery pipelines if they are triggered through repository WebHooks. After all, do we really want to push updated YAML files multiple times a day?

The number of replicas should not be part of the design. Instead, they are a fluctuating number that changes continuously (or at least often), depending on the traffic, memory and CPU utilization, and so on.

Depending on release frequency, the same can be said for image. If we are practicing continuous delivery or deployment, we might be releasing once a week, once a day, or even more often. In such cases, new images would be deployed often, and there is no strong argument for the need to change YAML files every time we make a new release. That is especially true if we are deploying through an automated process (as we should).

We’ll explore automation later on. For now, we’ll limit ourselves to a command similar to kubectl set image. We used it to change the image used by Pods with each release.

Scaling the deployment#

Similarly, we’ll use kubectl scale to change the number of replicas. Consider this an introduction to automation that is coming later on.

Enter to Rename, Shift+Enter to Preview

Scaling deployments through kubectl

We scaled the number of replicas associated with the Deployment go-demo-2-api. Please note that, this time, we did not use -f to reference a file. Since we have two Deployments specified in the same YAML, that would result in scaling of both. Since we wanted to limit it to a particular Deployment, we used its name instead.

Verification#

Let’s confirm that scaling indeed worked as expected.

Enter to Rename, Shift+Enter to Preview

Verify replicas

The output, limited to Deployments, is as follows.

NAME                           READY   UP-TO-DATE   AVAILABLE   AGE
deployment.apps/go-demo-2-db   1/1     1            1           4h40m
NAME                            READY   UP-TO-DATE   AVAILABLE   AGE
deployment.apps/go-demo-2-api   8/8     8            8           3h28m

Enter to Rename, Shift+Enter to Preview

Output of above command

As we mentioned earlier, we’ll dedicate quite a lot of time to automation, and you won’t have to scale your applications manually. However, it is useful to know that the kubectl scale command exists. For now, you know how to scale Deployments (and other Controllers).

Destroying Everything#

Before we enter the next stage of our knowledge-seeking mission, we’ll destroy the cluster we’re running and give our machines a break.

Enter to Rename, Shift+Enter to Preview

Delete Cluster

Try it yourself#

A list of all the commands used in the lesson is given below.

Enter to Rename, Shift+Enter to Preview

Commands used in this lesson

You can practice the commands in the following code playground by pressing the Run button and waiting for the cluster to set up.

/

go-demo-2-scaled.yml

go-demo-2-scaled.yml

apiVersion: apps/v1
kind: Deployment
metadata:
  name: go-demo-2-db
spec:
  selector:
    matchLabels:
      type: db
      service: go-demo-2
  strategy:
    type: Recreate
  template:
    metadata:
      labels:
        type: db
        service: go-demo-2
        vendor: MongoLabs
    spec:
      containers:
      - name: db
        image: mongo:3.3
        ports:
        - containerPort: 27017
---
apiVersion: v1
kind: Service
metadata:
  name: go-demo-2-db
spec:
  ports:

Enter to Rename, Shift+Enter to Preview

Code Playground

Updating Multiple Objects

Comparison with Docker Swarm

Mark as Completed

Report an Issue

Before We Begin

How Did We Get Here?

Pods

ReplicaSets

Services

Deployments

Ingress

Volumes

ConfigMaps

Secrets

Namespaces

Securing Kubernetes Clusters

Managing Resources

Creating A Production-Ready Kubernetes Cluster

Persisting State

Appendix: Running a Kubernetes Cluster Locally

Conclusion

Scaling Deployments

Scaling using YAML files#

Looking into the file#

Applying the definition#

Verification#

Automated scaling#

Scaling the deployment#

Verification#

Destroying Everything#

Try it yourself#

/