Full HDFS persistence support (read/write) #23

wajda · 2018-02-15T15:40:55Z

Currently HDFS persustence factory can only write the lineage data off to a file, but never read it back. To fully make use of this type of persistence we need to be able to read and parse the written lineage JSON to be able to link it with the descendant lineages and visualize in the Spline UI.

GeorgiChochov · 2019-08-08T08:46:35Z

The ability to persist lineage offline and import it after the fact would be quite useful.
I know you have ideas for how to expand on this, @wajda.

wajda · 2019-08-08T16:52:41Z

We could export/import the lineage.
Imaginary example:

I have a file A on HDFS, and its lineage in Spline
I want to copy A to a different infrastructure (network, cluster, cloud, you name it), monitored by another independent Spline instance. And I don't want to loose the lineage of A
Using Spline UI or CLI I dump A's full lineage to a file and carry it with A to a new infrastructure
There the lineage dump can be imported manually (again by using UI or CLI), or even automatically on the first Spline tracked read from A.

wajda · 2020-12-10T23:47:43Z

Superseded by #815 and AbsaOSS/spline-spark-agent#156

wajda added enhancement component: Persistence labels Feb 15, 2018

vackosar mentioned this issue May 9, 2018

"saveAsTable" and Spark Catalog support #21

Closed

wajda mentioned this issue Dec 10, 2020

Export / Import lineage data #815

Open

wajda closed this as completed Dec 10, 2020

wajda added the st-6-closed-549f90fb label Apr 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full HDFS persistence support (read/write) #23

Full HDFS persistence support (read/write) #23

wajda commented Feb 15, 2018

GeorgiChochov commented Aug 8, 2019

wajda commented Aug 8, 2019

wajda commented Dec 10, 2020

Full HDFS persistence support (read/write) #23

Full HDFS persistence support (read/write) #23

Comments

wajda commented Feb 15, 2018

GeorgiChochov commented Aug 8, 2019

wajda commented Aug 8, 2019

wajda commented Dec 10, 2020