This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
-
Updated
Jun 10, 2018 - Python
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered hive table performance comparison
Joining, Cleaning, Querying, Performing ETL on Twitter Posts Dataset.
Processing and transforming data via Hadoop Ecosystem
A simple project on the use of map and reduce in Hadoop.
Análisis al Proyecto GDELT con herramientas bigdata basadas den hadoop en nube Microsoft Azure
How to manage SCD2 with Apache Hive 1.1 and HBase 1.2 w/o HiveQL UPDATE operation
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
This repository is an application developed in Flask as BackEnd for connecting the Jupyter Notebook to the Hive Server and execute the queries and displays the results back in the UI
Real Time Streaming: Twitter Data Pipeline Using Big data Tools
Uses tokenized query returned by python-sqlparse and generates query metadata
Add a description, image, and links to the hiveql topic page so that developers can more easily learn about it.
To associate your repository with the hiveql topic, visit your repo's landing page and select "manage topics."