Skip to content
View NexZhu's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Organizations

@Ciao4j

Block or report NexZhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data

90 repositories

A DSL for data-driven computational pipelines

Groovy 3,315 775 Updated Mar 4, 2026

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 12,339 2,291 Updated Mar 5, 2026

Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems ca…

Java 763 312 Updated Jan 21, 2026

An orchestration platform for the development, production, and observation of data assets.

Python 15,055 2,008 Updated Mar 5, 2026

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 21,761 2,145 Updated Mar 5, 2026

JSON Hero is an open-source, beautiful JSON explorer for the web that lets you browse, search and navigate your JSON files at speed. 🚀. Built with 💜 by the Trigger.dev team.

TypeScript 10,607 637 Updated Nov 28, 2025

Search Google and download specific file types

Python 539 107 Updated Aug 30, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,613 3,509 Updated Mar 5, 2026

Data-Centric Pipelines and Data Versioning

Go 6,287 568 Updated Feb 3, 2025

A high-performance observability data pipeline.

Rust 21,422 2,025 Updated Mar 4, 2026

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 10,747 254 Updated Mar 2, 2026

Elyra extends JupyterLab with an AI centric approach.

Python 1,990 366 Updated Feb 9, 2026

High-Performance Serverless event and data processing platform

Go 5,682 557 Updated Mar 4, 2026

Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond

Java 2,071 900 Updated Mar 4, 2026

Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.

Java 2,294 1,741 Updated Feb 20, 2026

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 8,833 1,654 Updated Mar 4, 2026

A collective list of free APIs

Python 404,215 43,606 Updated Feb 19, 2026

A curated list of open source tools used in analytics platforms and data engineering ecosystem

441 52 Updated Mar 12, 2025

Grist is the evolution of spreadsheets.

TypeScript 10,731 543 Updated Mar 5, 2026

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 11,436 2,339 Updated Mar 4, 2026

The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL

Rust 6,241 496 Updated Mar 5, 2026

Streamlit — A faster way to build and share data apps.

Python 43,734 4,108 Updated Mar 5, 2026

AI + Data, online. https://vespa.ai

Java 6,811 700 Updated Mar 4, 2026

cuGraph - RAPIDS Graph Analytics Library

Cuda 2,132 345 Updated Mar 5, 2026

cuDF - GPU DataFrame Library

C++ 9,502 1,014 Updated Mar 5, 2026

cuML - RAPIDS Machine Learning Library

C++ 5,134 615 Updated Mar 4, 2026

Parallel computing with task scheduling

Python 13,756 1,849 Updated Mar 4, 2026

A terminal spreadsheet multitool for discovering and arranging data

Python 8,859 325 Updated Mar 5, 2026

💾 peer-to-peer sharing & live syncronization of files via command line

JavaScript 8,240 448 Updated May 7, 2023

World's largest Contributor driven code dataset | Used in Quark Search Engine, @OpenGenus IQ, OpenGenus Visual Project

C++ 13,714 3,683 Updated Oct 5, 2024