-
Dji Innovation, Tencent, Sun Yat-sen University
- Shenzhen, China
- kiwi-fung.win
Stars
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection" (AAAI 2022 Oral)
Source Code of our CVPR2021 paper "Rethinking BiSeNet For Real-time Semantic Segmentation"
[CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation
[CVPR 2025] VCR: Learning Appearance-Invariant Representations for Open-World Instance Segmentation
[CVPR2024] Code for "SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation".
SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence
FFNet: MetaMixer-based Efficient Convolutional Mixer Design
Real-Time High-Resolution Background Matting
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
[ICLR 2023 & IJCV 2025] SeaFormer: Squeeze-enhanced Axial Transformer
A list of papers, codes and applications on multi-task learning.
PytorchAutoDrive: Segmentation models (ERFNet, ENet, DeepLab, FCN...) and Lane detection models (SCNN, RESA, LSTR, LaneATT, BézierLaneNet...) based on PyTorch with fast training, visualization, ben…
UniDrive: Towards Universal Driving Perception Across Camera Configurations
A method to increase the speed and lower the memory footprint of existing vision transformers.
[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization"
Daily feed of the latest Computer Vision research papers from https://arxiv.org.
Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]
AI Image Signal Processing and Computational Photography. Official library for NTIRE (CVPR) and AIM (ICCV/ECCV) Challenges. You will find Learned ISPs, RAW Restoration-Upsampling-Reconstruction, Im…
Official repository of paper titled "CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications"