Realizing_Microservices_HPC

Realizing Microservices and High Performance Computing

Team Mentor	Email
Dan Lambright	dlambrig@gmail.com
Team Members	Email
Athanasios Filippidis	aflpd@bu.edu
Nadim El Helou	nadimh@bu.edu
Anqi Guo	anqiguo@bu.edu
Danny Trinh	djtrinh@bu.edu
Jialun Wang	wjl1996@bu.edu

Project Description

If you are looking for our setup and installation guide for all the steps to reproduce the project, please refer to our wiki
If you are looking for our final presentation, please refer to this link

1. Vision and Goals Of The Project:

This project has many different and equally intriguing aspects. It can be thought of as the continuation of last year’s students group. The first aspect is the automation of running Lustre in Kubernetes. Lustre is an open-source, distributed parallel file system designed for scalability, high-performance, and high-availability. In order to achieve this we will create Golang reconciler operators that will monitor the cluster and automatically scale Lustre based on different events as additions/removals of available instances and will deal with nodes/processes crashes.

As we gain the advantages of a Kubernetes managed cluster application there will be a performance deterioration due to the overlay network (more about this in Microsoft's Freeflow). We plan to tackle this by either utilizing remote direct memory access (RDMA) or by utilizing the same IPC namespace between different containers hosted in the same machine in order to have shared memory access between them. This will boost the communication latency and will hopefully overcome the aforementioned overhead.

2. Users/Personas Of The Project:

Cloud-native HPC with Lustre benefits users whose workload requires immense storage the most (petabytes worth of data). I/O performance has a widespread impact on these types of applications because of the scalable parallel read/write capabilities of Lustre and extremely fast sharing of information between containerized workloads through RDMA. In addition, Cloud-native HPC has features such as monitoring of clusters, autoscale-up/autoscale-down, autorecovery operators for Lustre that enable IT/devops engineers to support researchers in their HPC tasks. Since Cloud-native HPC incorporates Kubernates, users are not tied to a single cloud computing vendor such as AWS. At the end of the day we believe that our main persona for this project will be two kinds of people: those who interact directly with infrastructure and will fully utilize our work, and those who benefit from this utilization. The following are examples of our main target user:

Target Users

Lincoln Labs from MIT utilizes Lustre to feed a supercomputer files so the supercomputer can work on the files in parallel. MIT researchers rely on IT/devops engineers' expertise to help them load their algorithmic data into the filesystem for HPC tasks. Therefore, the IT/devops engineers will be deploying Cloud-native HPC with Lustre filesystems into Kubernetes for their researchers to use.
Data engineers and analysts analyze massive volumes of of financial data to detect fraudulant activity and perform financial analysis. Lustre benefits these users because financial data is ever-increasing and can become costly for financial institutions if the filesystem has performances issues.

Levels of Interaction

While researchers and data analysts utilize Lustre, they do not directly interact with the backend of the filesystem. They will never see the scaling up and scaling down of pods. Lustre pods will crash in the background and autorecover without any level of user interaction. It should essentially be a reliable service for target users and all they see is a high performance storage service for their HPC tasks.

The main individuals interacting with the backend of Cloud-native HPC with Lustre will be IT/devops engineers. They will easily be able to setup Lustre filesystems for their clients through the provided command-line scripts on Kubernetes.

3. Scope and Features Of The Project:

Continue implementing features developed by last year's student group

Set up Kubernetes on MOC instances
Run their command-line scripts to automate running Lustre on Kubernetes

Go Scripts to create "operators" that monitor the cluster and automate the maintenance of Luster within Kubernetes

Create a new Lustre instance when one crashes
Easy auto-scale of the number of instances based on new instance autodiscovery
Simplify Lustre code upgrades

Explore RDMA principles

Since RDMA is not available in MOC, simulate RMDA: attempt using open source "soft rdma" software
Experiment with sharing memory between containers on the same machine

Since we can only simulate RDMA, we may not be able to do many performance tests other than just for sharing memory between containers. We will only be managing the file system and storage aspects of this project; we will not be conducting any actual high performance computing or data analysis.

4. Solution Concept

Global Architectural Structure Of the Project

Below is a description of the system components that are building blocks of the architectural design:

Container: Standard, lightweight software unit that provides isolation for code and runtime environment.
Kubernetes: Open-source container orchestration platform, automating container operations.
Pod: Container wrapper in Kubernetes. Kubernetes’ management of containers is based on operations on pods.
Kubernetes node: Worker machine in Kubernetes cluster. Each node contains kubelet (a component to make sure containers are running), container runtime, and kube-proxy (a node-level network proxy).
KubeVirt: To run and manage VMs as Kubernetes pods, and allow VMs to access pod networking and storage.
Lustre: Open-source, parallel distributed file system, which is generally used for high performance computing.
Operators: Custom program controllers to monitor and make operations on Kubernetes nodes (e.g. create, destroy, restore, etc.).
Freeflow: High performance container overlay network. In our project, we may use it for RDMA communication, or learn from its concept to implement our memory sharing part.

Figure 1 presents our global architectural design of this project. Lustre nodes running inside containerized KubeVirt virtual machines. Containers are managed in Kubernetes pods, and each Kubernetes node could contain multiple pods. The operator will automatically create or destroy Kubernetes nodes according to user demands, or will restore node when one crashes. In each VM instance of MOC, there is a memory sharing module for nodes and containers.

5. Acceptance criteria

The MVP is to set up Lustre and running on MOC with Kubernetes on multiple machines.

Pick up previous work, adding and revoke Lustre components on the cloud system
Automate Lustre scaling by writing custom Golang operators for Kubernetes
With the support of CloudLab bare metal machine, do RDMA test and build Lustre on top of RDMA machine.

6. Release Planning:

10/1/2020 Demo 1: Setup single instance on MOC

Setup Kubernetes on single instance within a cluster on MOC
Setup multiple machines with Kubernetes in a single cluster
Presentation Slides

10/15/2020 Demo 2: Multi-instance within MOC and Operator exploration

Set up 3 different instances each with Kubernetes running on the same cluster
Implement the first two GO operators running locally. We will firstly focus on the autodiscovery/autoscale-up operator and then on the auto-shrinking operator
Freeflow exploration for container communication based off shared memory between containers
Presentation Slides

10/29/2020 Demo 3: Containers running Luster

Adjust the first two GO operators to work on a MOC machine running Kubernetes
Implement the third GO operator on health monitoring and respawning upon failure and deploy it on MOC
Demonstrate previous year’s project running within MOC setup
Exploration on feasibility of RDMA software simulation. Decision whether we will head this way or towards implementing memory sharing strategy for nodes and containers in the same MOC instance
Presentation Slides

11/12/2020 Demo 4: Memory Sharing

Finalize integration of the GO operators with Kubernetes instances within MOC and demonstrate Lustre operators running
Start implementing the decided memory sharing strategy for the Lustre nodes
Presentation Slides

12/3/2020 Demo 5:

Finish implementing the decided memory sharing strategy for the Lustre nodes
Presentation Recording Link

Uncertainties/Questions

In the creation of Replicasets, what happens when the relevent Lustre pods fail and recover? Will they recover nicely? Will there be any metadata to consider in the recovery process or will everything just work? Will corruption occur?

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
cloudlab		cloudlab
freeflow/tcp-mode		freeflow/tcp-mode
go		go
images		images
kubernetes_lustre_config		kubernetes_lustre_config
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Realizing_Microservices_HPC

Project Description

1. Vision and Goals Of The Project:

2. Users/Personas Of The Project:

3. Scope and Features Of The Project:

4. Solution Concept

5. Acceptance criteria

6. Release Planning:

Uncertainties/Questions

About

Uh oh!

Releases

Packages

Contributors 6

Uh oh!

Languages

BU-CLOUD-F20/Realizing_Microservices_HPC

Folders and files

Latest commit

History

Repository files navigation

Realizing_Microservices_HPC

Project Description

1. Vision and Goals Of The Project:

2. Users/Personas Of The Project:

3. Scope and Features Of The Project:

4. Solution Concept

5. Acceptance criteria

6. Release Planning:

Uncertainties/Questions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Uh oh!

Languages

Packages