Skip to content

vahana/etl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ETL Server

This is a demo server showcasing how to use prf, datasets and jobs libraries to build an ETL RESTful service. It exposes end points to submit, monitor and collect ETL jobs (distributed) using various database backends and Redis as job queue backend. It is built using PRF framework, which in turn is based on Pyramid.

PRF framework

PRF is a pyramid Restful Framework, that makes bootstraping, defining and processing RESTful endpoints generic, declarative and simplified. It supports various database backends, json payloads and RESTful resource tree declarations (e.g. nested resources). See prf repo for more info.

Datasets

Datasets is an abstraction over various data backends making managing and moving data around and between the backends transparent. It is used here to support ETL pipelines between different backends seamless and uniform. See datasets repo for more info.

Jobs

Jobs is a library used to define and queue jobs with batching, scheduling. It exposes RESTful endpoints to create and monitor jobs. See jobs repo for more info.

Setup and Run

git clone https://github.com/vahana/etl.git
mkvirtualenv etl #or use other ways of creating virtual env
cd etl
pip install --no-deps -r requirements.txt
#copy and edit config file
cp config_tmpl.ini local.ini
pserve local.ini

In a browser navigate to: http://0.0.0.0:6544/api/_ to get all defined endpoints.

Releases

No releases published

Packages

No packages published