Data in the Fast Lane

What can I do with Data in the Fast Lane (DFL)?

Data in the Fast Lane (DFL) is a data-oriented tool, design to perform extract, transform, and load (ETL) operations. DFL differentiates itself from other ETL tools as it focuses on scaling thanks to its backing by Apache Spark. DFL does not require any specific infrastructure, it can run on a laptop or leveraging containers in a cloud environment. It can run standalone or embedded in an application. Finally, it is designed to be extensible, with both the data transformation recipes and extension supporting source control.

DFL is:

Easy: we are not going to lie to you, every tool has its learning curve, but we have some great documentation in the doc folder and you'll see more soon on Medium and DZone.
Open Source: duh... and we picked a business-friendly Apache 2.0 license so you can innovate on top of our innovation with serenity.
Fast: Leverages Apache Spark, which is pretty fast.
Standalone: yeah, you don't need to add complex components, runtimes, gateway, or even license servers...
Scriptable: You can use DFL within scripts on the command line, you can start by having a look at the dfl.sh script and hack your way from there.
Embeddable: Use DFL as a framework and embed it directly in your Java/Scala apps as a Jar.
Scalable: have you heard of Spark's legendary scalability? We're just like that.
Extensible: Add your own transformations! From generating unique id from several secondary keys to pivoting aggregates, we got some pretty advanced transformations covered; but you will be able to add your very own!
Source control: you can source control everything in DFL: your data transformations and your extensions. No need to copy-paste stuff to extend your data pipelines!
Container: DFL runs like a charm in a container, as you'd expect, no?
Cloud-ready: running in a container is not enough, a cloud-ready application also needs to leverage cloud-native resources, and that's what DFL does!

Installation and first execution

Once you have cloned the source code, run:

mvn install -Dmaven.test.skip=true

Tests are working but may require external components that are not described in the requirements yet. To run Data in the Fast Lane (DFL), simply run:

dfl.sh <recipe-filename> [--dryrun]

Where your recipe file is a YAML file, like in:

dfl.sh ./src/test/resources/recipe-count-books.yaml

The execution returns:

DataFastLaneApp is starting and will process recipe: ./src/test/resources/recipe-count-books.yaml
2020-10-16 10:23:42.666 - INFO --- [           main] peration.run(CountRowsOperation.java:36): books has 24 row(s)
DFL completed in 9.2s with a status of true.

Advanced usage

DFL can also be used in an embedded way, as a Java framework. Look at the following examples:

LoadShowLabApp: executes a basic recipe.
LoadShowTransformLabApp: executes a recipe with a transformation.
LoadShowJoinLabApp: executes a recipe with a join between two CSV files.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
data		data
doc		doc
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dfl.sh		dfl.sh
eclipse-formatter.xml		eclipse-formatter.xml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

doc

doc

src

src

.gitignore

.gitignore

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

dfl.sh

dfl.sh

eclipse-formatter.xml

eclipse-formatter.xml

pom.xml

pom.xml

Repository files navigation

Data in the Fast Lane

What can I do with Data in the Fast Lane (DFL)?

Installation and first execution

Advanced usage

More documentation

Video demo

User manual

Blog posts

About

Releases

Packages

Contributors 2

Languages

License

AdvanceAutoParts/datafastlane

Folders and files

Latest commit

History

Repository files navigation

Data in the Fast Lane

What can I do with Data in the Fast Lane (DFL)?

Installation and first execution

Advanced usage

More documentation

Video demo

User manual

Blog posts

About

Resources

License

Stars

Watchers

Forks

Languages