big-data
Here are 252 public repositories matching this topic...
CMAK is a tool for managing Apache Kafka clusters
-
Updated
Aug 2, 2023 - Scala
PredictionIO, a machine learning server for developers and ML engineers.
-
Updated
Jan 9, 2021 - Scala
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
-
Updated
May 27, 2024 - Scala
Simple and Distributed Machine Learning
-
Updated
May 23, 2024 - Scala
High performance data store solution
-
Updated
May 18, 2024 - Scala
Sparkling Water provides H2O functionality inside Spark cluster
-
Updated
May 27, 2024 - Scala
Project for James' Apache Spark with Scala course
-
Updated
Jul 6, 2020 - Scala
Geo Spatial Data Analytics on Spark
-
Updated
Aug 26, 2021 - Scala
An open protocol for secure data sharing
-
Updated
May 24, 2024 - Scala
A simplified, lightweight ETL Framework based on Apache Spark
-
Updated
Jan 24, 2024 - Scala
Apache Spark Course Material
-
Updated
Apr 21, 2023 - Scala
Low-code tool for automating actions on real time data | Stream processing for the users.
-
Updated
May 27, 2024 - Scala
Big Data Visualization
-
Updated
Jan 29, 2023 - Scala
PredictionIO Recommendation Engine Template (Scala-based parallelized engine)
-
Updated
May 31, 2019 - Scala
Apache Spark 3 - Structured Streaming Course Material
-
Updated
Sep 8, 2020 - Scala
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."