Skip to content

Latest commit

 

History

History
40 lines (40 loc) · 6.56 KB

instance-segmentation.md

File metadata and controls

40 lines (40 loc) · 6.56 KB

Instance Segmentation

  • (CVPR'21) End-to-End Video Instance Segmentation with Transformers, [Paper], [Code]
  • (arXiv 2021.04) ISTR: End-to-End Instance Segmentation with Transformers, [Paper], [Code]
  • (arXiv 2021.08) SOTR: Segmenting Objects with Transformers, [Paper], [Code]
  • (arXiv 2021.12) SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation, [Paper], [Code]
  • (arXiv 2021.12) A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation, [Paper]
  • (arXiv 2021.12) SOIT: Segmenting Objects with Instance-Aware Transformers, [Paper], [Code]
  • (arXiv 2022.03) Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer, [Paper], [Code]
  • (arXiv 2022.04) Temporally Efficient Vision Transformer for Video Instance Segmentation, [Paper], [Code]
  • (arXiv 2022.04) Less than Few: Self-Shot Video Instance Segmentation, [Paper]
  • (arXiv 2022.06) Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention, [Paper]
  • (arXiv 2022.06) Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation, [Paper], [Code]
  • (arXiv 2022.07) OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers, [Paper], [Code]
  • (arXiv 2022.07) In Defense of Online Models for Video Instance Segmentation, [Paper], [Code]
  • (arXiv 2022.07) Video Mask Transfiner for High-Quality Video Instance Segmentation, [Paper], [Project]
  • (arXiv 2022.08) InstanceFormer: An Online Video Instance Segmentation Framework, [Paper], [Code]
  • (arXiv 2022.09) RNGDet++: Road Network Graph Detection by Transformer with Instance Segmentation and Multi-scale Features Enhancement, [Paper], [Code]
  • (arXiv 2022.10) AISFormer: Amodal Instance Segmentation with Transformer, [Paper], [Code]
  • (arXiv 2022.10) TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation, [Paper], [Code]
  • (arXiv 2022.11) Mean Shift Mask Transformer for Unseen Object Instance Segmentation, [Paper], [Code]
  • (arXiv 2022.11) Transformer for 3D Scene Instance Segmentation, [Paper], [Code]
  • (arXiv 2023.01) Vision Transformers Are Good Mask Auto-Labelers, [Paper], [Code]
  • (arXiv 2023.01) Towards Robust Video Instance Segmentation with Temporal-Aware Transformer, [Paper]
  • (arXiv 2023.03) MobileInst: Video Instance Segmentation on the Mobile, [Paper]
  • (arXiv 2023.04) DynaMITe: Dynamic Query Bootstrapping for Multi-object Interactive Segmentation Transformer, [Paper]
  • (arXiv 2023.04) Vision Transformers Are Good Mask Auto-Labelers, [Paper], [Code]
  • (arXiv 2023.06) CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation, [Paper], [Code]
  • (arXiv 2023.08) Partitioned Saliency Ranking with Dense Pyramid Transformers, [Paper], [Code]
  • (arXiv 2023.08) Exploring Transformers for Open-world Instance Segmentation, [Paper]
  • (arXiv 2023.08) Mask Frozen-DETR: High Quality Instance Segmentation with One GPU, [Paper]
  • (arXiv 2023.08) A Unified Query-based Paradigm for Camouflaged Instance Segmentation, [Paper], [Code]
  • (arXiv 2023.08) NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation, [Paper]
  • (arXiv 2023.09) Mask-Attention-Free Transformer for 3D Instance Segmentation, [Paper], [Code]
  • (arXiv 2023.09) TCOVIS: Temporally Consistent Online Video Instance Segmentation, [Paper], [Code]
  • (arXiv 2023.09) 3D Indoor Instance Segmentation in an Open-World, [Paper], [Code]
  • (arXiv 2023.10) MSFormer: A Skeleton-multiview Fusion Method For Tooth Instance Segmentation, [Paper]
  • (arXiv 2023.12) PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation, [Paper],[Code]
  • (arXiv 2023.12) EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation, [Paper]
  • (arXiv 2024.03) ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation, [Paper],[Code]
  • (arXiv 2024.04) Efficient Transformer Encoders for Mask2Former-style models, [Paper]