A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
-
Updated
Jan 11, 2024 - Java
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Data Engineering Project with Hadoop HDFS and Kafka
MapReduce Python Example
A Hadoop Wordcounter Job - Retrieves tweets and runs a MapReduce wordcounter for sentimental analysis
Simulation of a Hadoop distributed file system
This is old repository from my archive . Hope this might help me in near future
Big Data project. Web client for HDFS. Working in the terminal. Has ability to manipulate local and Hadoop storage
WIP: hdfs/libhdfs drop-in replacements without Java
Map-Reduce paradigm in Apache Hadoop for KNN algorithm based on Kaggle Titanic Dataset
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
Rust MiniDFS (local HDFS) Testcontainer
Yelp data analysis using HBase Java API and building a QA application
News Sentiment Analysis using ETL pipeline
The Ararajuba script aims to identify whether Optimized Row Columnar files in the Hadoop Distributed File System are corrupted, for this purpose it uses the count method and analyzes the difference in schemas in the tables.
Add a description, image, and links to the hdfs-dfs topic page so that developers can more easily learn about it.
To associate your repository with the hdfs-dfs topic, visit your repo's landing page and select "manage topics."