
Lists (29)
Sort Name ascending (A-Z)
3d
3dface
attention
audio_driven
cartoonify
dataperf
dataset
diffusion_model
discoart
🔮 Future ideas
GPT
julia
losses
medical_image
motion_capture
nerf
neural_differential_equations
probabilistic
quantization
similarity
sparse_training
spatiotemporal
text2image
text2motion
text2video
time_series_forecasting
unity
useful_tools
因缺思厅_cv
- All languages
- Assembly
- Batchfile
- BitBake
- C
- C#
- C++
- CSS
- Clojure
- Cuda
- Cython
- Dart
- Dockerfile
- Emacs Lisp
- Erlang
- Fortran
- GLSL
- Gherkin
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Objective-C
- Objective-C++
- OpenEdge ABL
- PHP
- Perl
- Python
- QML
- R
- Ruby
- Rust
- SCSS
- SWIG
- Scala
- Shell
- Svelte
- Swift
- TeX
- TypeScript
- V
- Vala
- Vue
Starred repositories
[ICASSP2023, GRSL2024, TGRS2025, ISPRS2025] semantic segmentation of remote sensing images
A Pytorch implement of medical image segmentation U-shape architecture benchmarks
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.…
No fortress, purely open ground. OpenManus is Coming.
[arXiv] The official code for "UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation".
Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow and framework knowledge base to unlock a new frontier of automated agents.
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Toolkit for linearizing PDFs for LLM datasets/training
[CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Enjoy the magic of Diffusion models!
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
LHU-Net: A Light Hybrid U-Net for Cost-efficient, High-performance Volumetric Medical Image Segmentation
Wan: Open and Advanced Large-Scale Video Generative Models
📷 A composable image editor using Core Image and Metal.
FastVideo is a lightweight framework for accelerating large video diffusion models.
Genome modeling and design across all domains of life
SkyReels V1: The first and most advanced open-source human-centric video foundation model
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥