Getting Started with State Persistence

In this lesson, we will discuss briefly about the state persistence of applications and create a cluster.

We'll cover the following

How to Persist State?
Kubernetes Failure Handling
Creating A Kubernetes Cluster

How to Persist State?#

Having fault-tolerance and high-availability is of no use if we lose application state during rescheduling.

Having state is unavoidable, and we need to preserve it no matter what happens to our applications, servers, or even a whole datacenter.

The way to preserve the state of our applications depends on their architecture. Some are storing data in-memory and rely on periodic backups. Others are capable of synchronizing data between multiple replicas, so that loss instance of one does not result in loss of data. Most, however, are relying on disk to store their state. We’ll focus on that group of stateful applications.

If we are to build fault-tolerant systems, we need to make sure that failure of any part of the system is recoverable. Since speed is of the essence, we cannot rely on manual operations to recuperate from failures. Even if we could, no one wants to be the person sitting in front of a screen, waiting for something to fail, only to bring it back to its previous state.

Kubernetes Failure Handling#

We already saw that Kubernetes would, in most cases, recuperate from a failure of an application, of a server, or even of a whole datacenter. It’ll reschedule Pods to healthy nodes. We also experienced how AWS and kops accomplish more or less the same effect on the infrastructure level. Auto-scaling groups will recreate failed nodes and, since they are provisioned with kops startup processes, new instances will have everything they need, and they will join the cluster.

The only thing that prevents us from saying that our system is (mostly) highly available and fault tolerant is the fact that we did not solve the problem of persisting state across failures. That’s the subject we’ll explore next.

We’ll try to preserve our data no matter what happens to our stateful applications or the servers where they run.

Creating A Kubernetes Cluster#

We’ll start by recreating a similar cluster as the one we used in the previous chapter.

Enter to Rename, Shift+Enter to Preview

We entered the local copy of the k8s-specs repository, pulled the latest code, and went into the cluster directory.

Setting Up Environment Variables#

In the previous chapter, we stored the environment variables we used in the kops file. Let’s take a quick look at them.

Enter to Rename, Shift+Enter to Preview

The output, without the keys, is as follows.

Enter to Rename, Shift+Enter to Preview

By storing the environment variables in a file, we can fast-track the process by loading them using the source command.

Please ensure that your copy of the file has all the lines starting with export. If that’s not the case, please update it accordingly.

Enter to Rename, Shift+Enter to Preview

Creating an S3 Bucket#

Now that the environment variables are set, we can proceed to create an S3 bucket.

Enter to Rename, Shift+Enter to Preview

Only for Windows Users#

The command that creates the kops alias is as follows. Execute it only if you are a Windows user.

Enter to Rename, Shift+Enter to Preview

Cluster Creation#

Now we can, finally, create a new Kubernetes cluster in AWS.

Enter to Rename, Shift+Enter to Preview

If we compare that command with the one we executed in the previous chapter, we’ll notice only a few minor changes. We increased node-count to 2 and node-size to t2.medium. That will give us more than enough capacity for the exercises we’ll run in this chapter.

Validation#

Let’s validate the cluster.

Enter to Rename, Shift+Enter to Preview

Assuming that enough time passed since we executed kops create cluster, the output should indicate that the cluster devops23.k8s.local is ready.

Please wait until the validation checks pass.

📝 A note to Windows users#

Kops was executed inside a container. It changed the context inside the container that is now gone. As a result, your local kubectl context was left intact. We’ll fix that by executing kops export kubecfg --name ＄{NAME} and export KUBECONFIG=＄PWD/config/kubecfg.yaml. The first command exported the config to /config/kubecfg.yaml. That path was specified through the environment variable KUBECONFIG and is mounted as config/kubecfg.yaml on local hard disk. The latter command exports KUBECONFIG locally. Through that variable, kubectl is now instructed to use the configuration in config/kubecfg.yaml instead of the default one. Before you run those commands, please give AWS a few minutes to create all the EC2 instances and for them to join the cluster. After waiting and executing those commands, you’ll be all set.

Creating Ingress and ELB DNS#

We’ll need Ingress if we’d like to access the applications we’ll deploy.

Enter to Rename, Shift+Enter to Preview

Ingress will not help us much without the ELB DNS, so we’ll get that as well.

Enter to Rename, Shift+Enter to Preview

The output of the latter command should end with us-east-2.elb.amazonaws.com.

Finally, now that we are finished with the cluster setup, we can go back to the repository root directory.

Enter to Rename, Shift+Enter to Preview

In the next lesson, we will deploy stateful applications without persisting their state.

What's Next?

Deploying Stateful Applications without Persisting State

Mark as Completed

Report an Issue

Before We Begin

How Did We Get Here?

Pods

ReplicaSets

Services

Deployments

Ingress

Volumes

ConfigMaps

Secrets

Namespaces

Securing Kubernetes Clusters

Managing Resources

Creating A Production-Ready Kubernetes Cluster

Persisting State

Appendix: Running a Kubernetes Cluster Locally

Conclusion

Getting Started with State Persistence

How to Persist State?#

Kubernetes Failure Handling#

Creating A Kubernetes Cluster#

Setting Up Environment Variables#

Creating an S3 Bucket#

Only for Windows Users#

Cluster Creation#

Validation#

📝 A note to Windows users#

Creating Ingress and ELB DNS#