apache-hadoop

Star

Here are 13 public repositories matching this topic...

kowaalczyk / spark-minimal-algorithms

Star

An python implementation of Minimal Mapreduce Algorithms for Apache Spark

python spark apache-spark algorithms python3 pyspark hadoop-mapreduce apache-hadoop minimal-algorithms

Updated Jun 22, 2020
Python

sawadogosalif / Big-Data-Technologies

Star

Big Data Technologies can be defined as software tools for analyzing, processing, and extracting data from an extremely complex and large data set with which traditional management tools can never deal

apache-spark apache-kafka apache-hive apache-hadoop apache-hbase pysark

Updated Apr 30, 2022
Python

unobatbayar / big-data-processing

Star

Learning Apache Hadoop for Big Data. Moreover, exploring Map Reduce, Apache Spark RDD, Distributed Processing and Stream Processing

big-data map-reduce apache-hadoop

Updated May 27, 2020
Python

Abdelhakim-gh / BigData_Project

Star

This project aims to establish a data streaming pipeline with storage, processing, and visualization

python github-api elasticsearch kibana apache-flink apache-kafka apache-hadoop

Updated Aug 12, 2024
Python

yycorcino / distributed-system-for-movie-recommendations

Star

Apache Spark with Apache Hadoop for Machine Learning Application

apache-spark google-cloud apache-hadoop

Updated Oct 15, 2024
Python

FayStatha / atds-project-NTUA-2021

Star

A project for Advanced Topics in Database Systems course of ECE, NTUA for fall semester of academic year 2020-2021.

apache-spark pyspark spark-sql apache-hadoop ntua-ece

Updated Mar 19, 2021
Python

felidsche / movie-recommender

Star

A movie recommendation system built using Apache Spark’s ML library

apache-spark recommender-system spark-mllib apache-hadoop

Updated Apr 14, 2021
Python

VikentiosVitalis / advanced_topics_in_database_systems

Star

Data Science Project - for 'Advanced Topics in Database Systems' M.Sc. Course ECE @ntua

python data-science big-data apache-spark pyspark apache-hadoop

Updated Jan 17, 2024
Python

hridayns / Big-Data-Apache-server-logs-analysis-using-Pig-and-Python

Star

Big Data – Apache server logs analysis using Pig and Python

python pig apache-pig logs-analysis apache-hadoop

Updated May 23, 2019
Python

on2e / ntua-atdb

Star

Advanced Topics in Databases course project - NTUA ECE - 2022-23

apache-spark pyspark spark-dataframes advanced-database apache-hadoop ntua-ece spark-rdd

Updated Mar 30, 2023
Python

felidsche / mail-spam-filter

Star

An email spam filter using Apache Spark’s ML library

apache-spark spark-ml apache-hadoop

Updated Apr 14, 2021
Python

esakik / data-engineering-essentials

Star

Samples related to data engineering, e.g. spark, embulk, airflow, etc.

apache-spark protocol-buffers amazon-emr data-engineering digdag fluentd apache-beam embulk apache-avro mrjob apache-airflow cloud-dataflow apache-hadoop cloud-dataproc

Updated Dec 8, 2022
Python

mohammadtavakoli78 / Cloud-Computing

Star

This is projects of Cloud Computing Course

docker kubernetes yarn hadoop docker-compose helm cloud-computing hdfs helm-charts statefulsets cloud-services helm-chart statefulset apache-hadoop

Updated Sep 2, 2022
Python

Improve this page

Add a description, image, and links to the apache-hadoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache-hadoop topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-hadoop

Here are 13 public repositories matching this topic...

kowaalczyk / spark-minimal-algorithms

sawadogosalif / Big-Data-Technologies

unobatbayar / big-data-processing

Abdelhakim-gh / BigData_Project

yycorcino / distributed-system-for-movie-recommendations

FayStatha / atds-project-NTUA-2021

felidsche / movie-recommender

VikentiosVitalis / advanced_topics_in_database_systems

hridayns / Big-Data-Apache-server-logs-analysis-using-Pig-and-Python

on2e / ntua-atdb

felidsche / mail-spam-filter

esakik / data-engineering-essentials

mohammadtavakoli78 / Cloud-Computing

Improve this page

Add this topic to your repo