Educative: Interactive Courses for Software Developers

Exploring the options#

How did we come up with the current memory and CPU values? Why did we set the memory of the MongoDB to 100Mi? Why not 50Mi or 1Gi? It is embarrassing to admit that the values we have right now are random. We guessed that the containers based on the vfarcic/go-demo-2 image require less resources than Mongo database, so their values are comparatively smaller. That was the only criteria we used to define the resources.

Before you frown upon the decision to put random values for resources, you should know that we do not have any metrics to back us up. Anybody’s guess is as good as ours.

The only way to truly know how much memory and CPU an application uses is by retrieving metrics. We’ll use Metrics Server for that purpose.

Metrics Server collects and interprets various signals like compute resource usage, lifecycle events, etc. In our case, we’re interested only in CPU and memory consumption of the containers we’re running in our cluster.

k3d clusters come with metrics-server already deployed as a system application.

The idea to develop a Metrics Server as a tool for monitoring needs is mostly abandoned. Its primary focus is to serve as an internal tool required for some of the Kubernetes features.

Instead, I’d suggest a combination of Prometheus combined with the Kubernetes API as the source of metrics and Alertmanager for your alerting needs. However, those tools are not in the scope of this chapter, so you might need to educate yourself from their documentation, or wait until the sequel to this book is published (the tentative name is Advanced Kubernetes).

ℹ️ Use Metrics Server only as a quick-and-dirty way to retrieve metrics. Explore the combination of Prometheus and Alertmanager for your monitoring and alerting needs.

Now that we clarified what Metrics Server is good for, as well as what it isn’t, we can proceed and confirm that it is indeed running inside our cluster.

Enter to Rename, Shift+Enter to Preview

Get pods

The output is as follows.

NAME                                      READY   STATUS      RESTARTS   AGE
coredns-7448499f4d-kc2ld                  1/1     Running     0          6m15s
metrics-server-86cbb8457f-j6g88           1/1     Running     0          6m15s
local-path-provisioner-5ff76fc89d-rvq6l   1/1     Running     0          6m15s
helm-install-traefik-crd-c2qlj            0/1     Completed   0          6m16s
svclb-traefik-2r4m2                       0/2     Pending     0          5m14s
helm-install-traefik-sp6c4                0/1     Completed   1          6m16s
traefik-97b44b794-sgl9m                   1/1     Running     0          5m14s

Enter to Rename, Shift+Enter to Preview

Output of above command

As you can see, metrics-server is running.

Let’s try a very simple query of Metrics Server.

Enter to Rename, Shift+Enter to Preview

Get top pods

The output is as follows.

Enter to Rename, Shift+Enter to Preview

Output of 'kubectl top pods'

We retrieved all the Pods in the default Namespace. As you can see, most of the available metrics are related to memory and CPU.

We can see that the memory usage of the DB Pod is somewhere around 35 megabytes. That’s quite a big difference from the 100Mi we set. Sure, this service is not under real production load but, since we’re simulating a “real” cluster, we’ll pretend that 35Mi is indeed memory usage under “real” conditions. That means that we overestimated the requests by assigning a value almost three times larger than the actual usage.

How about the CPU? Did we make such a colossal mistake with it as well? As a reminder, we set the CPU request to 0.3 and the limit to 0.5. However, based on the previous output, the CPU usage is around 5m or 0.005 CPU. We, again, made a huge mistake with resource specification. Our value is around sixty times higher.

Such deviations between our expectations (resource requests and limits) and the actual usage can lead to very unbalanced scheduling with undesirable effects. We’ll correct the resources soon. For now, we’ll explore what happens if the amount of resources is below the actual usage.

Try it yourself#

A list of all the commands used in the lesson is given below.

Enter to Rename, Shift+Enter to Preview

Commands used in this lesson

You can practice the commands in the following code playground by pressing the Run button and waiting for the cluster to set up.

/

go-demo-2-random.yml

go-demo-2-random.yml

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: go-demo-2
  annotations:
    kubernetes.io/ingress.class: "nginx"
    ingress.kubernetes.io/ssl-redirect: "false"
    nginx.ingress.kubernetes.io/ssl-redirect: "false"
spec:
  rules:
  - host: go-demo-2.com
    http:
      paths:
      - path: /demo
        pathType: ImplementationSpecific
        backend:
          service:
            name: go-demo-2-api
            port:
              number: 8080
---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: go-demo-2-db
spec:
  selector:
    matchLabels:
      type: db
      service: go-demo-2

Enter to Rename, Shift+Enter to Preview

Measuring memory and CPU consumption

Before We Begin

How Did We Get Here?

Pods

ReplicaSets

Services

Deployments

Ingress

Volumes

ConfigMaps

Secrets

Namespaces

Securing Kubernetes Clusters

Managing Resources

Creating A Production-Ready Kubernetes Cluster

Persisting State

Appendix: Running a Kubernetes Cluster Locally

Conclusion

Measuring the Actual Memory and CPU Consumption

Exploring the options#

Try it yourself#

/