Skip to content
/ ais-etl Public

Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on the fly and/or offline, locally to user data.

License

Notifications You must be signed in to change notification settings

NVIDIA/ais-etl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AIStore ETL (AIS ETL)

This repository contains the components and examples used to run Extract-Transform-Load (ETL) operations on an AIStore cluster.

For more information on how ETL works in AIStore, refer to the following documentation:

Repository Structure

  • transformers — ready-to-use ETL transformers that can be deployed on an AIStore cluster
  • runtime — ETL runtime definitions for init_class function
  • deploy — utility tools and deployment configurations
  • examples — example scripts for using and testing ETL features
  • docs — documentation-related content

Deploying AIStore for ETLs

To begin using ETLs in AIStore, you must first deploy an AIStore cluster on Kubernetes. Two reference deployments are provided, each serving a different purpose:

1. AIStore development with local Kubernetes

  • Folder: deploy/dev/k8s/kustomize
  • Intended for: Local AIStore development and functional testing of ETL transformers.
  • How to use: Start a local Kubernetes cluster (for example, with kind or minikube) and follow the step-by-step instructions in the referenced folder to deploy a minimal AIS cluster configured for ETL development.
  • Documentation: README

2. Production deployment with Kubernetes

  • Folder: deploy/prod/k8s
  • Intended for: Production-grade deployments of AIStore at scale.
  • How to use: Use the provided Dockerfiles to build AIS images and follow the tooling in the companion repository to install, upgrade, and monitor the cluster.
  • Documentation: AIS/K8s Operator and Deployment Playbooks

Verify your deployment

After deploying the AIS cluster, you can confirm that it is reachable by running:

$ ais etl show

A blank list ([]) and the absence of error messages indicate that the cluster is healthy and ready for you to register ETL transformers.

Note For the ETL functionality you must first initialize the transformation logic so that the cluster can execute it. The following folders and examples in this repository demonstrate how to build and register your own transformers.

About

Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on the fly and/or offline, locally to user data.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 14