A minimal Python library for Apache Arrow, connecting to the Rust arrow crate
-
Updated
Jul 2, 2024 - Rust
A minimal Python library for Apache Arrow, connecting to the Rust arrow crate
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
GeoArrow in Rust, Python, and JavaScript (WebAssembly) with vectorized geometry operations
Read specialized NGS formats as data frames in R, Python, and more.
An experimental (work-in-progress) implementation of Apache Arrow
Rust-based WebAssembly bindings to read and write Apache Parquet data
Building block library for using Apache Arrow in Rust WebAssembly modules.
ModelarDB: Model-Based Time Series Management from Edge to Cloud
Interoperability between Polars and Clickhouse
🦖 Efficiently evolve your old fixed-length data files into more modern file formats, fully parallelized!
A collection of user defined functions, from your favourite databases, in Apache Datafusion
Geospatial extensions for Polars
Rewriting SQLite in Rust for Learning and Fun using Apache Arrow and DataFusion ecosystem
Kusto client library optimized for data science workloads
A fast and simple command-line (CLI) tool to convert a Parquet file to an Apache Arrow file
HASH uses Apache Arrow within hEngine for in-memory columnar data representation and zero-copy reads
Query MongoDB via Apache Arrow and DataFusion
Add a description, image, and links to the apache-arrow topic page so that developers can more easily learn about it.
To associate your repository with the apache-arrow topic, visit your repo's landing page and select "manage topics."