-
Salesforce
- Seattle, WA
-
04:06
(UTC -08:00) - http://www.jakeswenson.com
- @jakes.io
- @jakeswenson
Data
Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.
Rust-based WebAssembly bindings to read and write Apache Parquet data
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
A RocksDB compliant high performance scalable embedded key-value store
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
Apache DataFusion Comet Spark Accelerator
Extremely fast Query Engine for DataFrames, written in Rust
Embeddable stream processing engine based on Apache DataFusion
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Distributed stream processing engine in Rust
Fast web applications through dynamic, partially-stateful dataflow
The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL
Apache DataFusion Ballista Distributed Query Engine
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
The lightweight, fault-tolerant database built on SQLite. Designed to keep your data highly available with minimal effort.





