Skip to content

newsha1998/DML-Lab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About this project

This project is dedicated to making deployments of distributed machine learning (ML) and deep learning (DL) workflows on on-premise infrastructure simple and scalable.

Components

Kubernetes (K8s) is used for clustering and resource management as well as scaling and management of containerized applications. Apache spark is leveraged for data processing and spark ml is used for development and deployment of distributed machine learning jobs and Kubeflow for distributed deep learning pipelines.

Project contents

Documentation

All documentation is included in docs folder where the following can be found:

  • installation: Guide for installing Kubernetes, Kubeflow, monitoring tools
  • howto: Explanations for possible workflows such as ML and DL job submission, using monitoring

Results

The results of selected benchmark tasks are in this folder

Deliverables

Weekly and biweekly reports and other required reports can be found here

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •