Goal

Goal
Get Started
Screenshots & User Guide
Build the doc
Runners

Goal

Job Orchestrator main goal is to enables developer to use a DevOPS approach as easy as possible. In order to do that, we connect to different API :

Gitlab API : gives use the ability to search for certain Docker images into Gitlab Docker registry with the name : runner in it. If an image contains that word, it is considered as an image that can run jobs from Nexus.
Nexus API : Nexus stores jobs. We use Nexus API to search on repositories for jobs that we want to run with Gitlab runner images on kubernetes
Rancher API : After choosing our Nexus job and our Gitlab runner Docker image we can configure our task. At the end of the form when submitting it, job orchestrator creates a Kubernetes Job Resource and deploys it with the Rancher API.

See more about Runners

Get Started

As you may have understood, Job orchestrator needs an ecosystem of application in order to work. It serves as a pass between all application's APIS.

Dependencies

Kubernetes Cluster >=v1.15.15
- Rancher Installed >=2.2.10 and API Endpoint accessible.
- Need Rancher Project with rights to : [Check workload statues, API Access, Check logs].
- Need a kubernetes Namespace with rights to : [deploy jobs, creates secrets, create configmaps].
Gitlab >=12.0.4
- Gitlab Installed and API Endpoint accessible.
- Group Project containing Docker Runner Images repositories.
- API ACCESS TOKEN for this particular group with rights to [ Read Registries, Read Repositories].
Nexus >=3.29.2-02
- Nexus Installed and API Endpoint accessible.
- Default Repository
- User / Password with rights to [Read artifacts, Search Queries]
Spark [Optional]
- Url to Spark UI
Airflow [Optional]
- Url to Airflow UI

Configuration

Application Configuration File

/var/www/html/conf/conf.php :

<?php 

// PROXY
define('PROXY_CONF',"proxy.domain.net");
define('NO_PROXY_CONF',".domain.net");

// LDAP
define('LDAP',FALSE);

// KUBERNETES / RANCHER
define('KUBERNETES_API_URL','https://<rancher-url>/v3/clusters/c-*****');
define('KUBERNETES_ACCESS_KEY','token-******');
define('KUBERNETES_PROJECT_KEY','p-******');
define('KUBERNETES_ACCESS_SECRET','*****************************');
define('KUBERNETES_NAMESPACE','<namespace>');

// NEXUS
define('NEXUS_URL','http://localhost:8081/');
define('NEXUS_API_URL','http://localhost:8081/service/rest/v1/');
define('NEXUS_DEFAULT_REPOSITORY','<default-repo>');
define('NEXUS_LOGIN',FALSE);
define('NEXUS_USER', '<user>');
define('NEXUS_PASSWORD', '*****************');

// GITLAB
define("GITLAB_API_URL","https://<gitlab-url>/api/v4/");
define("GITLAB_API_ACCESS_TOKEN","*****************");
define("GITLAB_GROUP_PROJET","<gitlab-group-project>");
define("GITLAB_MONITOR_URL","<gitlabmonitor-url>");

// SPARK
define('SPARK_LIVY_URL', 'http://<spark-url>/ui');

// AIRFLOW
define('AIRFLOW_URL','http://<airflow-url>/admin/');

?>

Job Orchestrator Uses LDAP to authenticate users.

You can bypass the auth system by changing the LDAP var to False (default).

If you want to use LDAP, you have to set a ldap-conf file :

LDAP Configuration File

/var/www/html/ldapconf/conf.php :

<?php 

// Ldap configuration Array
$config = [  
	'hosts'    => ['<ldap-ip>'],
	'base_dn'  => '',
	'username' => '*****************',
	'password' => '*****************',
	'account_suffix'   => ''
];

// Auth chain for filter autorised users to connect to the application.
$service_ldap_authorization_domain = 'CN=,OU=,DC=';

?>

NFS Configuration File

For some tasks, you may need to create mounting points on runners. In order to register the rights access on the ENV variables into the pod, we have to give them to Job Orchestrator. The following configuration file serve this purpose :

/var/www/html/conf/conf_cifs.json [Optional] :

{
	"nfs-server-1": {
		"name": "<nfs-server-ip>",
		"user": "<nfs-user>",
		"password": "**************",
		"domain": "<nfs-domain>"
	},
	"nfs-server-2": {
		"name": "",
		"user": "",
		"password": "",
		"domain": ""
	},
	...
}

Run it !

You can run job orchestrator from 3 different ways :

Docker Image

To run anywhere :

docker run -p 80:80 -v conf/:/var/www/html/conf/ ghcr.io/curie-data-factory/job-orchestrator:latest

Helm Chart

To deploy in production environments :

helm repo add curiedfcharts https://curie-data-factory.github.io/helm-charts
helm repo update

helm upgrade --install --namespace default --values ./my-values.yaml my-release curiedfcharts/job-orchestrator

More info Here

From sources

For dev purposes :

Clone git repository :

git clone https://github.com/curie-data-factory/job-orchestrator.git
cd job-orchestrator/

Create Conf files & folders :

mkdir conf ldapconf
touch conf/conf.php
touch conf/conf_cifs.json
touch ldapconf/conf.php

Set configuration variables see templates above
Then run the Docker Compose stack.

docker-compose up -d

http://localhost:80 Job Orchestrator front
http://localhost:8081 Nexus

Exec into the docker image

docker exec -it joborchestrator /bin/bash

Resolve composer package dependencies. See Here for installing and using composer.

composer install --no-dev --optimize-autoloader

Screenshots & User Guide

Build Doc

The documentation is compiled from markdown sources using Material for MkDocs To compile the documentation :

Go to your source directory :

cd job-orchestrator

Run the docker build command :

docker run --rm -i -v "$PWD:/docs" squidfunk/mkdocs-material:latest build

Runners

Runners are a type of containers that contains langage specific binaries that enables task to be completed on a well-defined and managed software environment, assuring reproducibility and predictability.

At the curie institute we use runner docker images for every task running in our cluster. The runner can be defined by any base docker image if it carries a python env and can execute the bootstrap-script.py at start.

The bootstrap script can be found in the script folder of this repo. It needs to be executed at the entrypoint of the container like so :

CMD ["python","/run.py"]

Data Factory - Institut Curie - 2021

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
css		css
docs		docs
errors		errors
img		img
js		js
scripts		scripts
version		version
webfonts		webfonts
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
clusterjobs.php		clusterjobs.php
composer.json		composer.json
composer.lock		composer.lock
core.php		core.php
createjob.php		createjob.php
docker-compose.yaml		docker-compose.yaml
footer.php		footer.php
gitlabmonitor.php		gitlabmonitor.php
header.php		header.php
help.php		help.php
index.php		index.php
job-orchestrator.conf		job-orchestrator.conf
login.php		login.php
logout.php		logout.php
mkdocs.yml		mkdocs.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Goal

Get Started

Dependencies

Configuration

Application Configuration File

LDAP Configuration File

NFS Configuration File

Run it !

Docker Image

Helm Chart

From sources

Screenshots & User Guide

Build Doc

Runners

About

Releases 2

Packages

Languages

License

curie-data-factory/job-orchestrator

Folders and files

Latest commit

History

Repository files navigation

Goal

Get Started

Dependencies

Configuration

Application Configuration File

LDAP Configuration File

NFS Configuration File

Run it !

Docker Image

Helm Chart

From sources

Screenshots & User Guide

Build Doc

Runners

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages