Skip to content

sumaniitm/data-lineage-change-explorer

Repository files navigation

Explore changes along the lineage of a data entity

The main branch of this repo will hold the base working application which can work standalone to visualize the quality/drift of the data entities across their lineage. The current version of the docker image can be found in https://hub.docker.com/layers/171555045/sumaniitm/data_lineage_change_explorer/v0_2/images/sha256-304f77592cd3705b5198dad3ff058cc0a7d6e469b83fa8491639bbc5d3aaa2f8?context=repo

The image is tested to be working on a Macbook Pro with PostgreSQL installed locally

However, since the core of this work is based on breadth first search of the nodes of a directed graph, overlapping edges emanating from & descending on multiple nodes will not be displayed This is needed to allow a clear view of the edge levels showing the percentage difference in the data

Do not forget to create your own credentials.py with two functions simply returning the database username and credentials and then create a .pyc file out of it, the path to this .pyc file should be in your config.txt

Main landing page

gango3

Intermediate page showing the list of entities (based on the users entries in config.txt), with navigation link

Screenshot 2021-10-12 at 11 52 26 AM

Final Output showing the changes in the source, transformations and destination of the selected (from the intermediate page) data entity

Screenshot 2021-10-12 at 11 53 33 AM

About

This repository is an attempt to track data drifts/changes within the lineage using breadth first search on the various data attributes present in the database

Resources

License

Stars

Watchers

Forks

Packages

No packages published