apache-hadoop

Star

Here are 6 public repositories matching this topic...

Guru107 / hadoop-small-files-merger

Star

A Spark application to merge small files on Hadoop

scala apache-spark avro text parquet apache-hadoop

Updated Sep 7, 2020
Scala

RBC-DSAI-IITM / DCEIL

Star

A fast, scalable and distributed community detection algorithm based on CEIL scoring function.

apache-spark community-detection apache-hadoop

Updated Jan 1, 2019
Scala

saitejavishalj / Hotspot-analysis-of-Geospatial-data

Star

Built a Large Scale Distributed Data Processing system for Streaming Analytics using Hadoop Ecosystem (Apache Spark and HDFS), in Cloud for real-time spatial analytics.

distributed-systems apache-spark hdfs data-analysis sparksql large-scale hadoop-ecosystem streaming-analytics apache-hadoop

Updated Jun 4, 2021
Scala

tspannhw / links

Star

Links

scala apache-spark sbt apache-hadoop

Updated Mar 12, 2018
Scala

shuuji3 / spark-ceph-connector

Star

🌟Spark Ceph Connector: Implementation of Hadoop Filesystem API for Ceph

spark apache-spark hadoop ceph apache-hadoop

Updated Aug 25, 2020
Scala

alextsaf / Advanced-Database-Systems-NTUA

Star

Apache Spark Analytics Queries Benchmarking

scala apache-spark data-analytics apache-hadoop

Updated Jan 26, 2023
Scala

Improve this page

Add a description, image, and links to the apache-hadoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache-hadoop topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-hadoop

Here are 6 public repositories matching this topic...

Guru107 / hadoop-small-files-merger

RBC-DSAI-IITM / DCEIL

saitejavishalj / Hotspot-analysis-of-Geospatial-data

tspannhw / links

shuuji3 / spark-ceph-connector

alextsaf / Advanced-Database-Systems-NTUA

Improve this page

Add this topic to your repo