apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
A better build tool for Java, Scala and Kotlin: 3-6x faster than Maven or Gradle, less fiddling with plugins, and more easily explorable in your IDE
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Open-source high-performance RISC-V processor
Rocket Chip Generator
An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more
An open protocol for secure data sharing
♞ lichess.org: the forever free, adless and open source chess server ♞
Source code for the X Recommendation Algorithm
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
Spark: The Definitive Guide's Code Repository
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
The Daml smart contract language
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
The Scala 3 compiler, also known as Dotty.
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
Apache DataFusion Comet Spark Accelerator
Hybrid search engine, combining best features of text and semantic search worlds
Reference applications for funding, operating, and incentivizing the use of a decentralized, public Canton synchronizer. Includes the Amulet reference application for creating native payment utilities for Canton synchronizers and Daml applications.
ZIO — A type-safe, composable library for async and concurrent programming in Scala