PYELT

Usage

This example will create and fill the historical staging area:

pipeline = Pipeline(config)
pipe = pipeline.get_or_create_pipe('test_source', source_config)

source_file = CsvFile(get_root_path() + '/sample_data/patienten1.csv', delimiter=';')
source_file.reflect()
source_file.set_primary_key(['patientnummer'])
mapping = SourceToSorMapping(source_file, 'persoon_hstage', auto_map=True)
pipe.mappings.append(mapping)

pipeline.run()

More examples can be found on the GitHub repository of NL Healthcare.

Introduction

Pyelt is a Python DDL and ETL framework for creating and loading Data Vaults for datawarehousing.

Pyelt supports several data-layers, including Source-of-Record (SOR), Raw datavault (RDV), Business datavault (BDV) and Datamarts (DM)

Pyelt can import data from several different source systems such as fixed length files, csv-files, and different databases.

Pyelt is developed to run on a postgreSQL database.

Pyelt uses the SQLAlchemy.core only for the connection and for reflection. All other SQL statements (ddl, copy, insert and update statements) are created by the pyelt framework itself.

Write your own mappings to transfer and transform data from sources via staging into the data ware house.

Content

(current documentation on pythonhosted is only in dutch):

work in progress:

api docs (https://github.com/NLHEALTHCARE/PYELT/tree/master/docs/source/09api.rst/>_

Background

The pyelt framework is presently under development at NL Healthcare, with the aim to implement our next-generation datawarehouse (DWH2.0). It serves as the foundation for our work in the area of clinical business intelligence (CBI) and machine-learning.

Architectural cornerstones of this project are:

the Data Vault (DV) design pattern of Hans Hultgren
Domain-specific modelling of the DV, following HL7 v3 Reference Information Model and the Dutch Detailed Clinical Model Zorginformatiebouwstenen (in Dutch).

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
docs		docs
pyelt		pyelt
sample_data		sample_data
samples		samples
tests		tests
.gitignore		.gitignore
.pypirc		.pypirc
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
TODOs.py		TODOs.py
main.py		main.py
make_dist.py		make_dist.py
setup.py		setup.py
versions.py		versions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PYELT

Usage

Introduction

Content

Background

About

Releases

Packages

Languages

License

spgraham/PYELT

Folders and files

Latest commit

History

Repository files navigation

PYELT

Usage

Introduction

Content

Background

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages