Uses tokenized query returned by python-sqlparse and generates query metadata
-
Updated
Nov 14, 2024 - Python
Uses tokenized query returned by python-sqlparse and generates query metadata
Real Time Streaming: Twitter Data Pipeline Using Big data Tools
This repository is an application developed in Flask as BackEnd for connecting the Jupyter Notebook to the Hive Server and execute the queries and displays the results back in the UI
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
How to manage SCD2 with Apache Hive 1.1 and HBase 1.2 w/o HiveQL UPDATE operation
Análisis al Proyecto GDELT con herramientas bigdata basadas den hadoop en nube Microsoft Azure
A simple project on the use of map and reduce in Hadoop.
Processing and transforming data via Hadoop Ecosystem
Joining, Cleaning, Querying, Performing ETL on Twitter Posts Dataset.
Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered hive table performance comparison
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Add a description, image, and links to the hiveql topic page so that developers can more easily learn about it.
To associate your repository with the hiveql topic, visit your repo's landing page and select "manage topics."