Skip to content
View leecoool's full-sized avatar

Block or report leecoool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Good-lookin' diffs. Actually… nah… The best-lookin' diffs. 🎉

Perl 17,505 338 Updated Feb 5, 2025

Mamba SSM architecture

Python 14,216 1,238 Updated Jan 18, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,749 201 Updated Mar 4, 2025

Analyze computation-communication overlap in V3/R1.

923 118 Updated Mar 3, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,917 716 Updated Mar 13, 2025

A very fast and expressive template engine.

Python 10,661 1,637 Updated Mar 5, 2025

Fast and memory-efficient exact attention

Python 16,268 1,540 Updated Mar 13, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,622 2,936 Updated Mar 13, 2025

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,403 679 Updated Mar 13, 2025

The official Meta Llama 3 GitHub site

Python 28,500 3,315 Updated Jan 26, 2025

Fully open reproduction of DeepSeek-R1

Python 22,720 2,041 Updated Mar 13, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,231 533 Updated Mar 13, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,445 120 Updated Apr 17, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,128 1,045 Updated Mar 12, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 36,779 2,789 Updated Mar 13, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 9,999 1,784 Updated Mar 13, 2025

TensorFlow code and pre-trained models for BERT

Python 38,818 9,671 Updated Jul 23, 2024
Jupyter Notebook 121 16 Updated Mar 4, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,134 1,120 Updated Mar 13, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,982 5,378 Updated Mar 12, 2025

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,067 2,011 Updated Apr 16, 2024

QSBR and EBR library

C 118 20 Updated Dec 15, 2019

Master programming by recreating your favorite technologies from scratch.

Markdown 355,525 32,969 Updated Sep 3, 2024

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

JavaScript 40,882 3,926 Updated Mar 13, 2025

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 11,338 1,005 Updated Mar 13, 2025

Game Servers Management on Kubernetes

Go 602 76 Updated Mar 12, 2025

A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.

Python 2,090 518 Updated Mar 11, 2025
Next
Showing results