Skip to content
View bobwenx's full-sized avatar

Block or report bobwenx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 3,305 333 Updated Jun 7, 2025

A Fast Key-Value Storage Engine Based on Hierarchical B+-Tree Trie

C++ 1,318 175 Updated Apr 11, 2024

国外互联网公司大数据技术架构研究

19 15 Updated Jan 11, 2021

DataX是阿里云DataWorks数据集成的开源版本。

Java 16,590 5,558 Updated May 27, 2025

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Java 4,257 581 Updated Jun 18, 2025

Dapr is a portable runtime for building distributed applications across cloud and edge, combining event-driven architecture with workflow orchestration.

Go 24,837 1,966 Updated Jun 18, 2025

Temporal service

Go 14,520 997 Updated Jun 19, 2025

Continuous Profiling Platform. Debug performance issues down to a single line of code

Go 10,652 653 Updated Jun 18, 2025

🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.

Rust 4,938 516 Updated Jun 18, 2025

Nydus - the Dragonfly image service, providing fast, secure and easy access to container images.

Rust 1,334 224 Updated Jun 18, 2025

Machine Learning Pipelines for Kubeflow

Python 3,862 1,747 Updated Jun 18, 2025

Distributed ML Training and Fine-Tuning on Kubernetes

Python 1,814 784 Updated Jun 11, 2025

Machine Learning Toolkit for Kubernetes

TypeScript 15,046 2,523 Updated Jun 4, 2025

A unified framework for privacy-preserving data analysis and machine learning

Python 2,462 441 Updated Jun 18, 2025

OpenTelemetry Collector

Go 5,317 1,653 Updated Jun 19, 2025

A fast and efficient cloud native application runtime

Go 837 175 Updated Jun 3, 2025

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,370 540 Updated Jun 19, 2025

🔥 Seata is an easy-to-use, high-performance, open source distributed transaction solution.

Java 25,673 8,833 Updated Jun 18, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37,597 6,464 Updated Jun 19, 2025

GLake: optimizing GPU memory management and IO transmission.

Python 467 41 Updated Mar 24, 2025

🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).

Rust 15,584 592 Updated Jun 19, 2025

Lightweight Kubernetes

Go 29,975 2,456 Updated Jun 18, 2025

The Kubernetes Package Manager

Go 28,024 7,259 Updated Jun 17, 2025

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 16,795 2,311 Updated Jun 19, 2025

Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.

Rust 2,760 217 Updated May 26, 2025

HoloInsight is a cloud-native observability platform with a special focus on real-time log analysis and AI integration.

Java 341 70 Updated May 27, 2025

Kubebuilder - SDK for building Kubernetes APIs using CRDs

Go 8,503 1,559 Updated Jun 18, 2025

CLI tool for spawning and running containers according to the OCI specification

Go 12,454 2,190 Updated Jun 19, 2025

Storage Performance Development Kit

C 3,261 1,244 Updated Jun 18, 2025

Data Plane Development Kit

C 3,756 1,316 Updated Jun 18, 2025
Next
Showing results