Skip to content
View xuwenbao's full-sized avatar
  • Chengdu, Sichuan, China

Block or report xuwenbao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
TypeScript 491 64 Updated Mar 7, 2025

Synthetic data curation for post-training and structured data extraction

Python 935 64 Updated Mar 7, 2025

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,533 181 Updated Mar 3, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,555 1,172 Updated Mar 7, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 8,689 571 Updated Mar 7, 2025

🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

Python 2,709 119 Updated Mar 1, 2025

Cache-Augmented Generation: A Simple, Efficient Alternative to RAG

Python 1,070 158 Updated Feb 16, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 13,976 1,417 Updated Mar 7, 2025

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 1,675 114 Updated Feb 25, 2025

deepseek思维树模式实现

Python 11 2 Updated Feb 7, 2025

Govern, Secure, and Optimize your AI Traffic. AI Gateway provides unified interface to all LLMs using OpenAI API format with a focus on performance and reliability. Built in Rust.

Rust 191 11 Updated Mar 7, 2025

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 23,409 3,571 Updated Mar 7, 2025

Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.

Python 20,216 2,688 Updated Mar 7, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Swift / Ultralytics…

Python 1,092 77 Updated Mar 7, 2025

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 5,822 381 Updated Mar 7, 2025

The RedStone repository includes code for preparing extensive datasets used in training large language models.

Python 112 9 Updated Feb 10, 2025

Making large AI models cheaper, faster and more accessible

Python 40,554 4,478 Updated Mar 7, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,789 2,400 Updated Mar 7, 2025

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,736 145 Updated Jan 17, 2025

Build your own AI friend

C++ 7,816 1,366 Updated Mar 7, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 14,692 1,634 Updated Feb 23, 2025

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 19,602 4,718 Updated Mar 3, 2025

Customizable implementation of the self-instruct paper.

Python 1,039 71 Updated Mar 7, 2024

Open source annotation tool for machine learning practitioners.

Python 9,831 1,753 Updated Nov 22, 2024

A pure Rust Excel/OpenDocument SpreadSheets file reader: rust on metal sheets

Rust 1,854 167 Updated Jan 31, 2025

A text extraction library supporting PDFs, images, office documents and more

Python 1,553 50 Updated Mar 7, 2025
23 3 Updated Feb 7, 2025

📄 A curated list of awesome .cursorrules files

12,179 828 Updated Jan 29, 2025
Next
Showing results