Skip to content

tech4242/docker-hadoop-hive-parquet

Repository files navigation

docker-hadoop-hive-parquet

This project will showcase how to spin up a Hadoop cluster with Hive in order to run SQL queries on Parquet files. Images for the nodes are based on https://hub.docker.com/u/bde2020 base images.

All of this makes more sense if you follow the link in the repository to the article on Medium :)