Skip to content
View goober's full-sized avatar

Block or report goober

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

JavaScript 5,117 250 Updated Apr 14, 2025

Create an issue on FireDucks

Jupyter Notebook 817 28 Updated Mar 22, 2025

Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.

Rust 564 38 Updated Apr 15, 2025

Home of the Open Data Contract Standard (ODCS).

Ruby 473 52 Updated Apr 1, 2025

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

Python 3,465 250 Updated Apr 15, 2025

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,286 170 Updated Apr 13, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 27,479 5,618 Updated Apr 12, 2025

Apache PyIceberg

Python 678 265 Updated Apr 15, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 17,860 4,442 Updated Apr 15, 2025

DuckDB-powered data lake analytics from Postgres

Rust 522 23 Updated Mar 19, 2025

sqlfmt formats your dbt SQL files so you don't have to

Python 447 18 Updated Apr 1, 2025

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 2,233 200 Updated Apr 15, 2025

The data-validation toolkit for enhanced dbt (data build tool) PR review

Python 335 12 Updated Apr 15, 2025

Useful macros when performing data audits

356 43 Updated Jan 23, 2025

Code review for data in dbt

Python 487 23 Updated Jan 3, 2025

dbt adapter for SQL Server and Azure SQL

Python 227 106 Updated Mar 31, 2025

A curated list of awesome ETL frameworks, libraries, and software.

3,385 353 Updated Jul 23, 2024

Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.

Go 546 43 Updated Apr 14, 2025

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 41,590 5,502 Updated Apr 15, 2025

A curated list of data engineering tools for software developers

7,292 1,314 Updated Apr 7, 2025

A library for generating fake data in Rust.

Rust 1,089 110 Updated Apr 11, 2025

Dagster Labs' open-source data platform, built with Dagster.

Python 342 25 Updated Apr 14, 2025

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java 1,425 218 Updated Apr 14, 2025

An orchestration platform for the development, production, and observation of data assets.

Python 12,938 1,647 Updated Apr 15, 2025

A modular SQL linter and auto-formatter with support for multiple dialects and templated code.

Python 8,780 798 Updated Apr 14, 2025

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 1,181 148 Updated Apr 15, 2025

End-to-end encrypted platform for photos, videos and 2FA secrets.

Dart 18,810 1,048 Updated Apr 15, 2025

Embedded property graph database built for speed. Vector search and full-text search built in. Implements Cypher.

C++ 2,120 140 Updated Apr 15, 2025

Custom Dashboards for Beancount in Fava

TypeScript 217 26 Updated Feb 12, 2025
Next
Showing results