Skip to content
View KellHuang's full-sized avatar

Block or report KellHuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

mash up of Wan2.1 + Meta Sapiens + Seaweed Diffusion APT for One-Step Video Generation if you have compute - call me

Python 45 5 Updated Mar 12, 2025

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

415 26 Updated Apr 11, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 27,497 1,831 Updated Apr 6, 2025
Python 4,617 313 Updated Apr 12, 2025

Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas…

Python 18,748 3,083 Updated Apr 2, 2025

自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili

Python 4,708 784 Updated Apr 12, 2025

Official Implementation of "KBLaM: Knowledge Base augmented Language Model"

Jupyter Notebook 1,238 98 Updated Apr 16, 2025

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 1,816 127 Updated Apr 17, 2025

Towards Human-Sounding Speech

Python 4,137 332 Updated Apr 16, 2025

YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel videos using AI agents. Extract insights from hours of content in seconds with semantic search and preci…

Python 415 50 Updated Mar 27, 2025

TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

Python 418 63 Updated Apr 16, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,311 165 Updated Apr 14, 2025

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 5,171 419 Updated Apr 11, 2025

LTX-Video Support for ComfyUI

Python 966 79 Updated Mar 10, 2025

Spark-TTS Inference Code

Python 8,519 876 Updated Apr 9, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 43,455 7,456 Updated Apr 16, 2025

NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms

Python 939 98 Updated Apr 3, 2025

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 996 71 Updated Mar 29, 2025

一款提示词优化器,助力于编写高质量的提示词

TypeScript 3,869 426 Updated Apr 10, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 89,835 11,284 Updated Apr 16, 2025

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 43,208 5,572 Updated Apr 17, 2025

Various custom nodes for ComfyUI

Python 1,217 118 Updated Apr 13, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 11,147 757 Updated Apr 17, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 21,592 1,786 Updated Mar 26, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 10,024 1,093 Updated Apr 2, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,807 523 Updated Apr 7, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,438 710 Updated Apr 16, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,439 821 Updated Mar 1, 2025
Next
Showing results