This project describes how to write full ETL data pipeline using spark.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
project
src
.gitignore
README.md
_config.yml
build.sbt
ipl-tweet.csv

README.md

spark-data-pipeline

Elasticsearch Setup

i) Download the Elasticsearch 6.3.0 or latest version and unzip it.

ii) Run the following command.

    $ bin/elasticsearch

Getting Started:

Clone and run in local mode:

    $ git clone git@github.com:techmonad/spark-data-pipeline.git
    $ cd spark-data-pipeline
    $ sbt run