A Delta Lake reader for Dask
-
Updated
Jul 17, 2024 - Python
A Delta Lake reader for Dask
A Python library for fast, interactive geospatial vector data visualization in Jupyter.
This application enables users to create and open SQLite databases, create tables, load data from json, csv and Parquet files, display table contents, and drop tables as needed.
Simple Parquet Viewer app based on the PyQT 5 GUI and DuckDB, PyArrow for data manipulations
Data Science Web App Streamlit: Analyzing Motor Vehicle Crashes from NYC
Sample implemention of different functions in django
Highly Open Workflow for Annotation & Ranking toward genomic variant Discovery
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Add a description, image, and links to the parquet topic page so that developers can more easily learn about it.
To associate your repository with the parquet topic, visit your repo's landing page and select "manage topics."