Data Engineering Project with Hadoop HDFS and Kafka
-
Updated
Nov 4, 2023 - Python
Data Engineering Project with Hadoop HDFS and Kafka
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
Simulation of a Hadoop distributed file system
MapReduce Python Example
A Hadoop Wordcounter Job - Retrieves tweets and runs a MapReduce wordcounter for sentimental analysis
Big Data project. Web client for HDFS. Working in the terminal. Has ability to manipulate local and Hadoop storage
Map-Reduce paradigm in Apache Hadoop for KNN algorithm based on Kaggle Titanic Dataset
Add a description, image, and links to the hdfs-dfs topic page so that developers can more easily learn about it.
To associate your repository with the hdfs-dfs topic, visit your repo's landing page and select "manage topics."