workload-scheduler

The Scheduler subsystem is implemented using Proto.Actor based on the Virtual Actors approach. You can read about interacting with Kafka here.

This component is responsible for distributing tasks among workloads. A Pod with its built-in pipeline acts as a workload.

The scheduler supports multi-tenant mode. Each client can create one or more applications. Several pipelines can be deployed within one application. Workload scaling is provided independently for each deployed pipeline.

For each pipeline, a Topic is created in a Kafka, which, in turn, is divided into several Partitions. All events from partitions are loaded using a forwarder into the scheduler of the corresponding pipeline type. The scheduler stores a dictionary of running tasks with their statuses. Dictionary with task states is saved to the TiDB database every few seconds. Scheduler guarantees At-least-once delivery.

Workers with pipelines periodically poll the scheduler about the availability of tasks for them. When a task is executed, the worker reports the pipeline execution statuses and the result to the scheduler. All communication is done through gRPC.

An example of creating a pipeline for application:

var pipeline = {
    name: 'ocr',
    chart: 'https://github.com/RyazanovAlexander/application.ocr/tree/main/chart'
};

http.post('http://localhost/api/applications/myname/pipelines', application, function(res){
    // ...
});

When creating this pipeline, a helm chart https://github.com/RyazanovAlexander/application.ocr/tree/main/chart will be deployed, which defines the following pipeline:

{
  "pipeline": [
    {
      "executor": "wget",
      "commands": [
        "wget -O /mnt/pipe/image.png {{url}}"
      ]
    },
    {
      "executor": "tesseract",
      "commands": [
        "tesseract /mnt/pipe/image.png /mnt/pipe/result",
        "cat /mnt/pipe/result.txt",
        "rm /mnt/pipe/result.txt"
      ]
    }
  ]
}

After successfully deploying the helm chart, we can send tasks for the pipelines through the workload scheduler:

var data = {
    url: 'https://some-addr.com/pic.png'
};

http.post('http://localhost/api/applications/myname/pipelines/ocr', data, function(res){
    // ...
});

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.deploy/charts/microservice		.deploy/charts/microservice
.devcontainer		.devcontainer
.diagrams		.diagrams
.github/workflows		.github/workflows
.vscode		.vscode
internal		internal
.cruft.json		.cruft.json
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
justfile		justfile
main.go		main.go
main_test.go		main_test.go
skaffold.yaml		skaffold.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

workload-scheduler

About

Releases

Packages

Languages

License

RyazanovAlexander/workload.scheduler

Folders and files

Latest commit

History

Repository files navigation

workload-scheduler

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages