Documented my learnings - how to perform DML operations in HIVE.
-
Updated
Oct 18, 2020 - HiveQL
Documented my learnings - how to perform DML operations in HIVE.
Performed sentiment analysis on Twitter data of 'Go' game - Google’s Alphago vs Se-Dol Lee. Utilized Hadoop HDFS via Oracle Cloud, HiveQL and Tableau.
Detailed about how we can dynamically load columns in HIVE using AVRO.
Streaming / Ingesting tweets using Flume into a hive data lake.
A HiveQL script with Hadoop/MapReduce Program to find out the most popular movies for different age groups.
Finding storage space requirement and data retrieval time for ORC and Parquet.
Apply and build analytical queries using Hive-HQL over large datasets, answer relevant questions in the data context
Created a simple web app which gives users a summary of the types of 311 requests in their Chicago neighborhood, built with Lambda Architecture principles using Apache's tech stack
For this project we studied 3 data sets revolving around neighborhoods in New York City. We hope to learn what neighborhoods in Brooklyn are good to live in
A repository for showcasing my knowledge of the HiveQL programming language, and continuing to learn the language
This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,
The HiveQL Programming language IDE submodule for SNU Programming Tools (2D Mode)
Add a description, image, and links to the hiveql topic page so that developers can more easily learn about it.
To associate your repository with the hiveql topic, visit your repo's landing page and select "manage topics."