big-data-processing

MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.

real-time spark apache-spark pyspark data-integration mapreduce real-time-data sqoop mapreduce-jobs sales-analysis spark-structured-streaming mapreduce-java real-time-database big-data-processing rdds sqoop-export sqoop-import big-data-analysis medical-data-management

Updated Jun 7, 2023
Java

vvittis / FlinkSampling

Star

Reservoir Sampling for Group-By Queries in Flink Platform. Answering effectively Single Aggregate.

java topic stratum apache-flink sampling reservoir-sampling streaming-data big-data-analytics group-by big-data-processing streaming-tuples

Updated Aug 12, 2023
Java

eskimo-sh / eskimo

Star

Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.

Updated Sep 14, 2023
Java

airscholar / FlinkCommerce

Star

This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elasticsearch, and Postgres

python big-data apache-flink big-data-processing realtime-streaming

Updated Dec 4, 2023
Java

Improve this page

Add a description, image, and links to the big-data-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the big-data-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

big-data-processing

Here are 11 public repositories matching this topic...

Bennyhwanggggg / Basic-Hadoop-MapReduce

lucamoroz / BigDataComputing-UniPD

jamestiotio / dbsys

StarPlatinumStudio / Flink-SQL-Practice

leonardoGemin / BigDataComputing_UniPd

mohamedsaleh1984 / twitter-spark

DuarteDomingues / Large-Scale-Data-Computation-Word-Count-project

Ayoub-etoullali / Activites-Pratiques-BigData

vvittis / FlinkSampling

eskimo-sh / eskimo

airscholar / FlinkCommerce

Improve this page

Add this topic to your repo