rdd
Here are 29 public repositories matching this topic...
Sentiment Analysis and Data Visualization
-
Updated
May 20, 2018 - Python
Streaming data in Spark and doing data analytics
-
Updated
Sep 19, 2019 - Python
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
-
Updated
Jan 3, 2020 - Python
This project aims to more closely represent what happens in the brain by simulating a spiking neural net. It uses RDD to try and learn from the edge cases too!
-
Updated
May 29, 2020 - Python
In this simple project, I am playing with the data sets of the city of Montreal counting the number of neighborhoods finding the largest ones, their different types, and so on using RDDs.
-
Updated
Mar 18, 2021 - Python
CC4Spark enables to generate XES event logs from distributed data sources, and solve conformance checking alignment problems in distributed environments.
-
Updated
Jun 18, 2021 - Python
Project: Spark SQL & DataFrames - Course: Advanced Topics in Databases (9th semester) NTUA
-
Updated
Mar 24, 2022 - Python
PageRank - Pig vs PySpark comparison https://madoc.univ-nantes.fr/mod/assign/view.php?id=1511791
-
Updated
Oct 20, 2022 - Python
[ECE NTUA] Advanced Topics in Databases - Course project (2022-2023)
-
Updated
Mar 3, 2023 - Python
ECE NTUA Assignment
-
Updated
Mar 30, 2023 - Python
Repo to contain the assignments for DSCI 553: Foundations and Applications of Data Mining course at USC
-
Updated
May 5, 2023 - Python
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
-
Updated
Jun 26, 2023 - Python
Solved various big data problems using pySpark . Variety of Tranformations and Actions are applied on RDDs and Data-Frames to extract different insights from various Data-Sets which are very huge in file ranging in GBs.
-
Updated
Oct 26, 2023 - Python
Improve this page
Add a description, image, and links to the rdd topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rdd topic, visit your repo's landing page and select "manage topics."