Stars
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
A system-level, binary package and environment manager running on all major operating systems and platforms.
A multi-tenant server for securely deploying and managing Dask clusters.
Azure Blob Storage FileSystem backend for Dask
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Python library for configuring a package including defaults, env variable loading, and yaml loading.
Easy-to-run example notebooks for Dask
⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
Exploring public Chicago crimes data set in Python
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.
Submit and execute distributed computations. A dask.distributed scheduler and Dispatcher.jl integration.
Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning al…
Pythonic file-system interface for Google Cloud Storage
Python Sorted Collections Library
Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set
Python client library for Mesos Marathon's REST API
A framework for out-of-core and parallel execution
Wrapper around `fitsio.ImageHDU` creating a `dask.Array`
Interactive Data Visualization in the browser, from Python
Functional persistent data structures for Python
The tool for managing conda-forge feedstocks.
A wrapper for libhdfs3 to interact with HDFS from Python