Stars
PyTorch implementation of gated PixelCNN
A naive implementation of PixelCNN in Pytorch as described in A Oord et. al.
[CVPR 2025] StdGEN: Semantic-Decomposed 3D Character Generation from Single Images
The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"
Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields
SpatialLM: Large Language Model for Spatial Understanding
[Support 0.47.x](Reset Cursor AI MachineID & Auto Sign Up / In)自动注册 Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machine. …
Make websites accessible for AI agents
OptiScaler bridges upscaling/frame gen across GPUs. Supports DLSS2+/XeSS/FSR2+ inputs, replaces native upscalers, enables FSR3 FG on non-FG titles. Supports Nukem mod for DLSSG-to-FSR3 FG.
No fortress, purely open ground. OpenManus is Coming.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Wan: Open and Advanced Large-Scale Video Generative Models
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…
Open source RGB lighting control that doesn't depend on manufacturer software. Supports Windows, Linux, MacOS. Mirror of https://gitlab.com/CalcProgrammer1/OpenRGB. Releases can be found on GitLab.
SkyReels V1: The first and most advanced open-source human-centric video foundation model
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Janus-Series: Unified Multimodal Understanding and Generation Models
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
A set of nodes to edit videos using the Hunyuan Video model
A system daemon to allow session software to update firmware
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Helper application for Linux distributions serving as a kind of "entry point" for running and integrating AppImages
Testing baseline LLMs performance across various models
Efficient vision foundation models for high-resolution generation and perception.
Support for miscellaneous image models. Currently supports: DiT, PixArt, HunYuanDiT, MiaoBi, and a few VAEs.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer