Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
-
Updated
Apr 22, 2024 - Scala
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Lightweight real-time big data streaming engine over Akka
Apache Spark Course Material
Apache Spark 3 - Structured Streaming Course Material
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件
A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica
智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!
An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Add a description, image, and links to the bigdata topic page so that developers can more easily learn about it.
To associate your repository with the bigdata topic, visit your repo's landing page and select "manage topics."