GitHub - houthacker/lumberjack-parent: Toy project for the Extract part in an ETL pipeline

#Lumberjack

Description

Lumberjack is an ETL-pipeline.

It extracts data from both batch sources such as disk and streaming sources like Apache Kafka. This data is then prepared for the Transform phase and moved to the desired output location, such as disk or Apache Kafka. Finally, the data is loaded in the target system of choice. The modules of lumberjack are described below.

Lumberjack will support at least the following sources and destinations:

Disk (extract, transform, load)
Apache Kafka (extract, transform, load)
Apache Accumulo (load)

lumberjack-shared

Like a Spar is used to skid other trees to the landing, this module contains code which is used by other modules.

lumberjack-metrics

Much like a yarder operator, this module is used by other modules to generate metrics.

lumberjack-extract

Like a whistle punk is responsible for the safety of other elements further down the pipeline, this module loads data from source and moves it after preparation into the pipeline for further processing.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
lumberjack-extract		lumberjack-extract
lumberjack-metrics		lumberjack-metrics
lumberjack-shared		lumberjack-shared
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

lumberjack-shared

lumberjack-metrics

lumberjack-extract

About

Releases

Packages

Languages

houthacker/lumberjack-parent

Folders and files

Latest commit

History

Repository files navigation

Description

lumberjack-shared

lumberjack-metrics

lumberjack-extract

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages