Kubernetes Scheduling: Node Selection, Affinity & AntiAffinity

The developer at Mystique Unicorn are interested in building their application using event-driven architectural pattern to process streaming data. For those who are unfamiliar, An event-driven architecture uses events to trigger and communicate between decoupled services and is common in modern applications built with microservices. An event is a change in state, or an update, like an item being placed in a shopping cart on an e-commerce website.

In this application, Kubernetes has been chosen as the platform to host the producers and consumers. The producers will be running all the time, so they would like to use on-demand instances and probably reserve them in the future. They are interested in running their consumers on AWS Spot instances to allow for scaling based on volumes and keep costs under control. They also want to ensure that the producers and consumers are not placed on the same nodes.

Can you help them?

🎯 Solutions

You can constrain a Pod so that it can only run on particular set of Node. There are several ways to do this and the recommended approaches all use label selectors to facilitate the selection. Generally such constraints are unnecessary, as the scheduler will automatically do a reasonable placement

nodeSelector: It is a field of PodSpec and specifies a map of key-value pairs.
podAffinity & podAntiAffinity: Allows you to constrain which nodes your pod is eligible to be scheduled based on labels on pods that are already running on the node rather than based on labels on nodes. It require substantial amount of processing which can slow down scheduling in large clusters significantly. Kubernetes community does not recommend using them in clusters larger than several hundred nodes.

In this demo, I will show you how to use these options to constrain your pod deployments

🧰 Prerequisites

This demo, instructions, scripts and cloudformation template is designed to be run in us-east-1. With few modifications you can try it out in other regions as well(Not covered here).
- 🛠 AWS CLI Installed & Configured - Get help here
- 🛠 AWS CDK Installed & Configured - Get help here
- 🛠 Python Packages, Change the below commands to suit your OS, the following is written for amzn linux 2
  - Python3 - yum install -y python3
  - Python Pip - yum install -y python-pip
  - Virtualenv - pip3 install virtualenv

⚙️ Setting up the environment

Get the application code

git clone https://github.com/miztiik/eks-security-with-psp
cd eks-security-with-psp

🚀 Prepare the dev environment to run AWS CDK

We will use cdk to make our deployments easier. Lets go ahead and install the necessary components.
```
# You should have npm pre-installed
# If you DONT have cdk installed
npm install -g aws-cdk

# Make sure you in root directory
python3 -m venv .venv
source .venv/bin/activate
pip3 install -r requirements.txt
```
The very first time you deploy an AWS CDK app into an environment (account/region), you’ll need to install a bootstrap stack, Otherwise just go ahead and deploy using cdk deploy.
```
cdk bootstrap
cdk ls
# Follow on screen prompts
```
You should see an output of the available stacks,
```
eks-cluster-vpc-stack
eks-cluster-stack
ssm-agent-installer-daemonset-stack
```
🚀 Deploying the application

Let us walk through each of the stacks,
- Stack: eks-cluster-vpc-stack To host our EKS cluster we need a custom VPC. This stack will build a multi-az VPC with the following attributes,
  - VPC:
    - 2-AZ Subnets with Public, Private and Isolated Subnets.
    - 1 NAT GW for internet access from private subnets
  Initiate the deployment with the following command,
```
cdk deploy eks-cluster-vpc-stack
```
  After successfully deploying the stack, Check the Outputs section of the stack.
- Stack: eks-cluster-stack As we are starting out a new cluster, we will use most default. No logging is configured or any add-ons. The cluster will have the following attributes,
  - The control pane is launched with public access. i.e the cluster can be access without a bastion host
  - c_admin IAM role added to aws-auth configMap to administer the cluster from CLI.
  - One OnDemand managed EC2 node group created from a launch template
    - It create two t3.medium instances running Amazon Linux 2
    - Auto-scaling Group with 2 desired instances.
    - The nodes will have a node role attached to them with AmazonSSMManagedInstanceCore permissions
    - Kubernetes label app:miztiik_on_demand_ng
  - One Spot managed EC2 node group created from a launch template
    - It create two t3.large instances running Amazon Linux 2
    - Auto-scaling Group with 1 desired instances.
    - The nodes will have a node role attached to them with AmazonSSMManagedInstanceCore permissions
  In this demo, let us launch the EKS cluster in a custom VPC using AWS CDK. Initiate the deployment with the following command,
```
cdk deploy eks-cluster-stack
```
  After successfully deploying the stack, Check the Outputs section of the stack. You will find the *ConfigCommand* that allows yous to interact with your cluster using kubectl
- Stack: ssm-agent-installer-daemonset-stack This EKS AMI used in this stack does not include the AWS SSM Agent out of the box. If we ever want to patch or run something remotely on our EKS nodes, this agent is really helpful to automate those tasks. We will deploy a daemonset that will run exactly once? on each node using a cron entry injection that deletes itself after successful execution. If you are interested take a look at the deamonset manifest here stacks/back_end/eks_cluster_stacks/eks_ssm_daemonset_stack/eks_ssm_daemonset_stack.py. This is inspired by this AWS guidance.
  
  Initiate the deployment with the following command,
```
cdk deploy ssm-agent-installer-daemonset-stack
```
  After successfully deploying the stack, You can use connect to the worker nodes instance using SSM Session Manager.

🔬 Testing the solution

We are all set with our cluster to deploy our pods.

Create Producer Pods

Since this is demo is about nodeSelector and affinity feature of kubernetes, We will run busybox image and label it miztiik-producer. We will later use this label to constraint our consumers.

I have included a sample manifest here stacks/k8s_utils/sample_manifests/producer_anti_affinity.yml. The interesting thing to note here is the node selector label miztiik_on_demand_ng. This will ensure the deployment runs only on ondemand instances. We are also using the podAntiAffinity to ensure that none of the producers are placed alongside consumer pods.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: on-demand-producers
spec:
  selector:
    matchLabels:
      app: miztiik-producer
  replicas: 3
  template:
    metadata:
      labels:
        app: miztiik-producer
    spec:
      affinity:
        podAntiAffinity:
          requiredDuringSchedulingIgnoredDuringExecution:
          - labelSelector:
              matchExpressions:
              - key: app
                operator: In
                values:
                - miztiik-consumer
            topologyKey: "kubernetes.io/hostname"
      nodeSelector:
        app: miztiik_on_demand_ng
      containers:
      - name: busybox
        image: busybox
        command: [ "sh", "-c", "sleep 10h" ]

Deploy the manifest,

kubectl get po --selector app=miztiik-producer

Expected output,

NAME                                   READY   STATUS    RESTARTS   AGE
on-demand-producers-5c7d4dfcc9-dmv4c   1/1     Running   2          26h
on-demand-producers-5c7d4dfcc9-msssw   1/1     Running   2          26h
on-demand-producers-5c7d4dfcc9-vcxfz   1/1     Running   2          26h

Create Consumers:

Since this is demo is about nodeSelector and affinity feature of kubernetes, We will run busybox image and label it miztiik-consumer. We will later use this label to constraint our consumers.

I have included a sample manifest here stacks/k8s_utils/sample_manifests/consumer_anti_affinity.yml. The interesting thing to note here is the node selector label miztiik_spot_ng. This will ensure the deployment runs only on spot instances. We are also using the podAntiAffinity to ensure that none of the consumers are placed alongside producer pods.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: spot-consumers
spec:
  selector:
    matchLabels:
      app: miztiik-consumer
  replicas: 3
  template:
    metadata:
      labels:
        app: miztiik-consumer
    spec:
      affinity:
        podAntiAffinity:
          requiredDuringSchedulingIgnoredDuringExecution:
          - labelSelector:
              matchExpressions:
              - key: app
                operator: In
                values:
                - miztiik-producer
            topologyKey: "kubernetes.io/hostname"
      nodeSelector:
        app: miztiik_spot_ng
      containers:
      - name: busybox
        image: busybox
        command: [ "sh", "-c", "sleep 10h" ]

Deploy the manifest,

kubectl get po --selector app=miztiik-consumer

Expected output,

NAME                              READY   STATUS    RESTARTS   AGE
spot-consumers-6cb6bd49dd-8kxwt   1/1     Running   2          26h
spot-consumers-6cb6bd49dd-lc4rd   1/1     Running   2          26h
spot-consumers-6cb6bd49dd-lpshs   1/1     Running   2          26h

📒 Conclusion

Here we have demonstrated how to use Kubernetes selectors to schedule pods. Given the complexities involved these features need to be used carefully.
🧹 CleanUp

If you want to destroy all the resources created by the stack, Execute the below command to delete the stack, or you can delete the stack from console as well
- Resources created during Deploying The Application
- Delete CloudWatch Lambda LogGroups
- Any other custom resources, you have created for this demo
```
# Delete from cdk
cdk destroy

# Follow any on-screen prompts

# Delete the CF Stack, If you used cloudformation to deploy the stack.
aws cloudformation delete-stack \
  --stack-name "MiztiikAutomationStack" \
  --region "${AWS_REGION}"
```
This is not an exhaustive list, please carry out other necessary steps as maybe applicable to your needs.

📌 Who is using this

This repository aims to show how to schedule pods using kubernetes schedulers to new developers, Solution Architects & Ops Engineers in AWS. Based on that knowledge these Udemy course #1, course #2 helps you build complete architecture in AWS.

💡 Help/Suggestions or 🐛 Bugs

Thank you for your interest in contributing to our project. Whether it is a bug report, new feature, correction, or additional documentation or solutions, we greatly value feedback and contributions from our community. Start here

👋 Buy me a coffee

Buy me a coffee ☕.

📚 References

Kubernetes Docs: Assigning Pods to Nodes

🏷️ Metadata

Level: 200

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github		.github
images		images
stacks		stacks
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
app.py		app.py
cdk.json		cdk.json
keda_crd_cleanup.md		keda_crd_cleanup.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kubernetes Scheduling: Node Selection, Affinity & AntiAffinity

🎯 Solutions

🧰 Prerequisites

⚙️ Setting up the environment

🚀 Prepare the dev environment to run AWS CDK

🚀 Deploying the application

🔬 Testing the solution

📒 Conclusion

🧹 CleanUp

📌 Who is using this

💡 Help/Suggestions or 🐛 Bugs

👋 Buy me a coffee

📚 References

🏷️ Metadata

About

Releases

Sponsor this project

Packages

Languages

miztiik/eks-scheduling-with-affinity

Folders and files

Latest commit

History

Repository files navigation

Kubernetes Scheduling: Node Selection, Affinity & AntiAffinity

🎯 Solutions

🧰 Prerequisites

⚙️ Setting up the environment

🚀 Prepare the dev environment to run AWS CDK

🚀 Deploying the application

🔬 Testing the solution

📒 Conclusion

🧹 CleanUp

📌 Who is using this

💡 Help/Suggestions or 🐛 Bugs

👋 Buy me a coffee

📚 References

🏷️ Metadata

About

Topics

Resources

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages