KellHuang

Follow

KellHuang KellHuang

Follow

3 followers · 2 following

Lists (2)

Sort

Hallo

🚀 My stack

Stars

johndpope / OmniHuman-1-hack

mash up of Wan2.1 + Meta Sapiens + Seaweed Diffusion APT for One-Step Video Generation if you have compute - call me

Python 45 5 Updated Mar 12, 2025

Fantasy-AMAP / fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

415 26 Updated Apr 11, 2025

ocrmypdf / OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 27,497 1,831 Updated Apr 6, 2025

bytedance / MegaTTS3

Python 4,617 313 Updated Apr 12, 2025

microsoft / qlib

Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas…

Python 18,748 3,083 Updated Apr 2, 2025

dreammis / social-auto-upload

自动化上传视频到社交媒体：抖音、小红书、视频号、tiktok、youtube、bilibili

Python 4,708 784 Updated Apr 12, 2025

microsoft / KBLaM

Official Implementation of "KBLaM: Knowledge Base augmented Language Model"

Jupyter Notebook 1,238 98 Updated Apr 16, 2025

aigc3d / LHM

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 1,816 127 Updated Apr 17, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 4,137 332 Updated Apr 16, 2025

wassim249 / YT-Navigator

YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel videos using AI agents. Extract insights from hours of content in seconds with semantic search and preci…

Python 415 50 Updated Mar 27, 2025

mims-harvard / TxAgent

TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

Python 418 63 Updated Apr 16, 2025

GuijiAI / HeyGem.ai

C 6,999 1,187 Updated Apr 16, 2025

hkchengrex / MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,311 165 Updated Apr 14, 2025

nanobrowser / nanobrowser

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 5,171 419 Updated Apr 11, 2025

Lightricks / ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

Python 966 79 Updated Mar 10, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 8,519 876 Updated Apr 9, 2025

mannaandpoem / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 43,455 7,456 Updated Apr 16, 2025

ElectricAlexis / NotaGen

NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms

Python 939 98 Updated Apr 3, 2025

THUDM / CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 996 71 Updated Mar 29, 2025

linshenkx / prompt-optimizer

一款提示词优化器，助力于编写高质量的提示词

TypeScript 3,869 426 Updated Apr 10, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 89,835 11,284 Updated Apr 16, 2025

oobabooga / text-generation-webui

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 43,208 5,572 Updated Apr 17, 2025

kijai / ComfyUI-KJNodes

Various custom nodes for ComfyUI

Python 1,217 118 Updated Apr 13, 2025

kijai / ComfyUI-WanVideoWrapper

Python 2,011 112 Updated Apr 16, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 11,147 757 Updated Apr 17, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 21,592 1,786 Updated Mar 26, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 10,024 1,093 Updated Apr 2, 2025

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,807 523 Updated Apr 7, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,438 710 Updated Apr 16, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,439 821 Updated Mar 1, 2025