hadoop

Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.

golang presto hadoop athena parquet hacktoberfest golang-package parquet-schema

Updated Apr 9, 2024
Go

Tencent / caelus

Star

Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs

docker kubernetes yarn hadoop runtime containerd

Updated Apr 1, 2024
Go

localghost / hadoop-http-jmx-exporter

Star

Prometheus exporter of Hadoop JMX metrics

hadoop metrics prometheus-exporter jmx jmx-exporter

Updated Mar 31, 2024
Go

snowplow / dataflow-runner

Star

Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR

spark hadoop amazon-emr flink golang-application

Updated Mar 25, 2024
Go

pandalanax / hdfshelper

Star

a configuration option helper for hadoop. fuzzy find what you are looking for.

devops hadoop configuration hdfs

Updated Feb 26, 2024
Go

chriskery / hadoop-operator

Star

Kubernetes operator for managing the lifecycle of Apache Hadoop Yarn Tasks on Kubernetes.

kubernetes hadoop k8s hadoop-cluster kubernetes-operator apache-hadoop

Updated Jan 19, 2024
Go

PBWebMedia / yarn-prometheus-exporter

Star

Export Hadoop YARN (resource-manager) metrics in prometheus format

yarn hadoop metrics exporter apache prometheus resource-manager yarn-hadoop-cluster apache-hadoop

Updated Oct 13, 2023
Go

MisterZurg / stepik_VK_Hadoop

Star

📓 Solutions to Stepik "Hadoop. Система для обработки больших объемов данных" course

hadoop vk stepik vkteam vk-education

Updated Jul 10, 2023
Go

briansterle / cluster-fastcopy

Star

copy data between hdfs clusters blazingly fast

spark yarn hadoop bigdata hdfs distcp

Updated Jun 2, 2023
Go

matchy-at-snu / distributed-system-project

Star

☁ Batch processing Word-Letter Count application with a customed k8s scheduler

python java spark hadoop distributed-system

Updated Apr 29, 2023
Go

anchitrao / OrangeSyrup

Star

A parallel cloud computing framework based on the core principles of Apache Hadoop.

golang distributed-systems hadoop mapreduce hadoop-mapreduce

Updated Dec 9, 2022
Go

parsa-hn / distributed-computing

Star

Projects for Introduction to Distributed computing course.

java golang scala spark hadoop

Updated Oct 28, 2022
Go

rootsongjc / magpie

Sponsor

Star

Yarn on Docker - Managing Hadoop Yarn cluster with Docker Swarm.

docker yarn hadoop containers swarm

Updated Dec 7, 2021
Go

beyondstorage / go-service-hdfs

Star

HDFS/Hadoop support for go-storage

hadoop storage hdfs

Updated Oct 22, 2021
Go

hv7214 / HDFS-Heterogeneous-Storage-Resource-Management-based-on-Data-Temperature

Star

This repo contains the code implementation of the paper "HDFS Heterogeneous Storage Resource Management based on Data Temperature"

golang big-data hadoop distributed-computing tiered-storage-management

Updated May 2, 2021
Go

openinx / huker

Star

An easy Hadoop deploy system

hadoop auto-deployment java-deployment hadoop-deployment

Updated Sep 2, 2020
Go

Improve this page

Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hadoop

Here are 30 public repositories matching this topic...

zncdatadev / hdfs-operator

apache / doris-streamloader

jehiah / gomrjob

apache / calcite-avatica-go

fraugster / parquet-go

Tencent / caelus

localghost / hadoop-http-jmx-exporter

snowplow / dataflow-runner

pandalanax / hdfshelper

chriskery / hadoop-operator

PBWebMedia / yarn-prometheus-exporter

MisterZurg / stepik_VK_Hadoop

briansterle / cluster-fastcopy

matchy-at-snu / distributed-system-project

anchitrao / OrangeSyrup

parsa-hn / distributed-computing

rootsongjc / magpie

beyondstorage / go-service-hdfs

hv7214 / HDFS-Heterogeneous-Storage-Resource-Management-based-on-Data-Temperature

openinx / huker

Improve this page

Add this topic to your repo