Trabajo para el curso de Hadoop, realizado por el Grupo 5
-
Updated
Aug 18, 2022 - Shell
Trabajo para el curso de Hadoop, realizado por el Grupo 5
ETL process which loads and transforms Medicare hospital data using Python and Hive
The project is based on Big-Data.In this particular project I have used twitter for sentiment analysis.
We build a Forex-currency rates pipeline to get currency rates from an external API and load the data into HDFS from where we use pyspark job to massage the data and insert it into a Hive table. The objective of this pipeline is to get the data ready for any downstream machine learning pipeline.
Add a description, image, and links to the hiveql topic page so that developers can more easily learn about it.
To associate your repository with the hiveql topic, visit your repo's landing page and select "manage topics."