Skip to content
View zephyrzilla's full-sized avatar
:octocat:
🍔 · 😴 · 👨‍💻 · 🔁
:octocat:
🍔 · 😴 · 👨‍💻 · 🔁

Sponsoring

@python

Organizations

@ServiceNow @junlplab

Block or report zephyrzilla

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A minimal shared memory object store design

C 50 9 Updated Oct 29, 2016

koding with any LLMs

JavaScript 896 324 Updated Mar 8, 2025

Model Context Protocol Servers

JavaScript 14,295 1,497 Updated Mar 3, 2025

Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.

Python 729 82 Updated Mar 7, 2025

A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your ML and analytics workloads.

Python 195 31 Updated Mar 9, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,714 681 Updated Mar 8, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,005 329 Updated Mar 5, 2025

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,646 1,226 Updated Mar 8, 2025

PyTorch native quantization and sparsity for training and inference

Python 1,887 229 Updated Mar 8, 2025

Umami is a simple, fast, privacy-focused alternative to Google Analytics.

TypeScript 25,116 4,616 Updated Mar 8, 2025

Simple replication of DPR (Dense Passage Retrieval)

Python 45 4 Updated Nov 10, 2023

A toolkit for building dense retrievers with deep language models.

Python 57 4 Updated Sep 24, 2021

🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

Jupyter Notebook 2,847 341 Updated Feb 13, 2025

A Python library to inspect and modify the internal structure of a PDF file

Python 977 27 Updated Mar 9, 2025

🌎 Python library to access all public routing, isochrones and matrix APIs in a consistent manner.

Python 292 31 Updated Nov 22, 2024

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java 13,286 3,412 Updated Mar 9, 2025

Apache Pinot - A realtime distributed OLAP datastore

Java 5,659 1,337 Updated Mar 7, 2025

Apache Ignite

Java 4,890 1,910 Updated Mar 6, 2025

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…

Java 2,236 750 Updated Mar 7, 2025

Deep Learning for humans

Python 62,668 19,534 Updated Mar 8, 2025

Improve keyboard comfort and usability with advanced customization

Rust 4,413 161 Updated Mar 8, 2025

Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.

Python 575 41 Updated Mar 7, 2025

Serving Inside Pytorch

C++ 155 13 Updated Mar 9, 2025

OpenHealth, AI Health Assistant | Powered by Your Data

TypeScript 3,130 304 Updated Mar 6, 2025

Toolkit to run Python benchmarks

Python 838 82 Updated Mar 4, 2025

The httplib2 caching algorithms packaged up for use with requests.

Python 483 126 Updated Mar 7, 2025

Generate large synthetic data using an LLM

Python 389 32 Updated Mar 7, 2025

Customizable implementation of the self-instruct paper.

Python 1,039 71 Updated Mar 7, 2024

Synthetic data generators for tabular and time-series data

Jupyter Notebook 1,510 249 Updated Feb 28, 2025

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 3,106 212 Updated Mar 9, 2025
Next
Showing results