Skip to content
View ZZHbible's full-sized avatar

Block or report ZZHbible

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 1,418 77 Updated Mar 28, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 6,164 586 Updated Mar 27, 2025

Make websites accessible for AI agents

Python 49,902 5,220 Updated Mar 28, 2025

Fully open reproduction of DeepSeek-R1

Python 23,447 2,132 Updated Mar 28, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,453 65 Updated Mar 19, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 36,268 6,161 Updated Mar 28, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,385 811 Updated Mar 1, 2025

[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Python 80 2 Updated Mar 18, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 42,327 6,328 Updated Mar 28, 2025

Efficient Triton Kernels for LLM Training

Python 4,741 286 Updated Mar 28, 2025

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…

Jupyter Notebook 7,465 1,168 Updated Mar 24, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,932 426 Updated Mar 5, 2025

Structured Text Generation

Python 11,165 575 Updated Mar 26, 2025

real time face swap and one-click video deepfake with only a single image

Python 48,604 7,135 Updated Mar 28, 2025

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 7,176 497 Updated Jan 3, 2025

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …

Python 1,202 104 Updated Mar 18, 2025

语音转文字实时翻译中文

Python 14 3 Updated Oct 19, 2023

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 9,202 936 Updated Mar 28, 2025

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

Python 1,488 145 Updated Jul 29, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,977 6,528 Updated Mar 28, 2025

WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)

Python 11 Updated May 9, 2024

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Jupyter Notebook 1,879 114 Updated Sep 18, 2024

Create Music in Seconds with SunoAPI.

Python 1,608 247 Updated Mar 13, 2025

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,353 874 Updated Dec 10, 2024

repository for dreamoving-phantom https://www.modelscope.cn/studios/vigen/DreaMoving_Phantom/summary. DreaMoving-Phantom is a general and automatic image enhancement and super resolution framework.

Python 133 10 Updated Feb 2, 2024
Python 3,877 253 Updated Mar 15, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 25,878 2,493 Updated Mar 27, 2025

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 21,215 6,147 Updated Mar 23, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 28,238 3,994 Updated Nov 24, 2024
Next
Showing results