-
Notifications
You must be signed in to change notification settings - Fork 961
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for showing lineage of table across ETL's in a data warehouse #69
Comments
@apoorva007 we haven't build the support in databuilder for neo4j as metadata backend. But I know @verdan 's team has built support for pushing lineage info into atlas which is another proxy offering for metadata. cc @verdan for further comment. |
@apoorva007 you can use Amundsen with either Neo4j or Apache Atlas as the backend. We use Apache Atlas (and are constantly working on improving and adding new features in Amundsen) as our metadata proxy. Atlas supports the lineage out of the box, which is a one of a huge advantage of using Atlas as metadata proxy. It connects with many elements like spark, hive etc. using hooks and bridges, and thus records the whole lineage out of the box. |
First option - link to Atlas or other 3rd partyAs @verdan points to Atlas already has strong lineage modelling and UI - a shortcut to navigating to that from Amundsen might simply be to populate the 2nd option - "the younger brother" populate Neo4jAnother option which we could name "Atlas lineage's younger brother" would appear if you take a closer look at Amundsen's current Neo4j backend's data model. You'll notice the ingredients to model and store lineage are already in the diagram. It's the In fact you could claim that the - Related issue in the frontend repo discussing "more ambitious" Amundsen lineage visualization: https://github.com/lyft/amundsenfrontendlibrary/issues/110 |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. |
2nd option above has moved a bit forward with merge of amundsen-io/amundsendatabuilder#126 as far as I can see this means with Neo4j #69 (comment) is essentially implemented so you can build a Databuilder job which will build lineage in Amundsen Metadata service. Submitting a PR to ingest some sample data modeled after previous sample ingestion PRs seems like a good If you have React skills https://github.com/lyft/amundsenfrontendlibrary/issues/110 might be what you want to look at Roadmap linkhttps://github.com/lyft/amundsen/blob/master/docs/roadmap.md#native-lineage-integration |
* [AMD-120] Add relation pre-processor in Neo4jPublisher * Update * Added DeleteRelationPreprocessor * Added DeleteRelationPreprocessor * Update * Update
Signed-off-by: feng-tao <fengtao04@gmail.com>
Signed-off-by: feng-tao <fengtao04@gmail.com>
Signed-off-by: feng-tao <fengtao04@gmail.com>
* [AMD-120] Add relation pre-processor in Neo4jPublisher * Update * Added DeleteRelationPreprocessor * Added DeleteRelationPreprocessor * Update * Update
* [AMD-120] Add relation pre-processor in Neo4jPublisher * Update * Added DeleteRelationPreprocessor * Added DeleteRelationPreprocessor * Update * Update
* Fixes the names and tags issue for atlas * Moved the helper functions to atlasclient module * Updates the atlasclient package to pyatlasclient
Signed-off-by: feng-tao <fengtao04@gmail.com>
disable KDN fallback for specific owners
Signed-off-by: feng-tao <fengtao04@gmail.com>
* [AMD-120] Add relation pre-processor in Neo4jPublisher * Update * Added DeleteRelationPreprocessor * Added DeleteRelationPreprocessor * Update * Update
Hey,
Wanted to understand if there is a way to push table lineage related metadata to amundsen.
Couldn't find any related documentation. If someone can point towards it, that would be really helpful
The text was updated successfully, but these errors were encountered: