Skip to content
View zhoushaw's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@web-infra-dev

Block or report zhoushaw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An extremely fast Python package and project manager, written in Rust.

Rust 45,718 1,289 Updated Mar 23, 2025

Empower the Web community and invite more to build across platforms.

C++ 10,857 341 Updated Mar 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,477 530 Updated Mar 23, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,358 806 Updated Mar 1, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,266 264 Updated Mar 20, 2025

Integrate the DeepSeek API into popular softwares

29,807 3,234 Updated Mar 21, 2025

Let AI be your browser operator.

HTML 7,162 399 Updated Mar 21, 2025

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 3,232 227 Updated Mar 22, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 42,763 5,865 Updated Mar 21, 2025

A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

TypeScript 4,284 312 Updated Mar 23, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,408 901 Updated Mar 20, 2025

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,315 152 Updated Mar 3, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,431 427 Updated May 29, 2024

End-to-end stack for WebRTC. SFU media server and SDKs.

Go 12,027 1,051 Updated Mar 22, 2025

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 31,299 5,775 Updated Mar 19, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,580 4,311 Updated Mar 23, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,443 142 Updated Mar 10, 2025

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 67,468 11,446 Updated Jul 30, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 88,147 23,662 Updated Mar 23, 2025

PyTorch Tutorial for Deep Learning Researchers

Python 30,998 8,186 Updated Aug 15, 2023

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,706 203 Updated Mar 6, 2025

LLM全栈优质资源汇总

Shell 509 60 Updated Nov 25, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 15,567 1,806 Updated Mar 2, 2025

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,121 69 Updated Mar 13, 2025

The LLMs' framework optimized for ultra-fast response times.

TypeScript 64 4 Updated Mar 10, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,312 1,140 Updated Mar 14, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,961 632 Updated Mar 7, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,345 6,409 Updated Mar 23, 2025
Next
Showing results