Skip to content

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

Notifications You must be signed in to change notification settings

witnessai/Awesome-Open-Vocabulary-Object-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 

Repository files navigation

Awesome-Open-Vocabulary-Object-Detection

Contact

scottn@foxmail.com

Papers

2023

  • Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Claudio Gennaro, Fabrizio Falchi. The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding. arxiv 2023. [paper]
  • MIC: Zhao Wang, Aoxue Li, Fengwei Zhou, Zhenguo Li, Qi Dou. Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization. BMVC 2023. [paper]
  • CoDet: Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi. CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection. NeurIPS 2023. [paper] [code]
  • DE-ViT: Xinyu Zhang, Yuting Wang, Abdeslam Boularias. Detect Every Thing with Few Examples. GCPR 2023. [paper] [code]
  • DITO: Dahun Kim, Anelia Angelova, Weicheng Kuo. Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection. arxiv 2023. [paper] [code]
  • CFM-ViT: Dahun Kim, Anelia Angelova, Weicheng Kuo. Contrastive Feature Masking Open-Vocabulary Vision Transformer. ICCV 2023. [paper]
  • EdaDet: Cheng Shi, Sibei Yang. EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment. ICCV 2023. [paper]
  • Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy. Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. ICCV 2023. [paper] [code]
  • Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng. What Makes Good Open-Vocabulary Detector: A Disassembling Perspective. KDD workshop 2023. [paper]
  • MMC-Det: Yifan Xu, Mengdan Zhang, Xiaoshan Yang, Changsheng Xu. Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection. arxiv 2023. [paper]
  • OVDEval: Yiyang Yao, Peng Liu, Tiancheng Zhao, Qianqian Zhang, Jiajia Liao, Chunxin Fang, Kyusong Lee, Qing Wang. How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection. arxiv 2023. [paper] [code]
  • SAS-Det: Shiyu Zhao, Samuel Schulter, Long Zhao, Zhixing Zhang, Vijay Kumar B. G, Yumin Suh, Manmohan Chandraker, Dimitris N. Metaxas. Improving Pseudo Labels for Open-Vocabulary Object Detection. arxiv 2023. [paper]
  • Chaoyang Zhu, Long Chen. A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future. arxiv 2023. [paper]
  • UOVN: Hengcan Shi, Munawar Hayat, Jianfei Cai. Unified Open-Vocabulary Dense Visual Prediction. arxiv 2023. [paper]
  • SGDN: Hengcan Shi, Munawar Hayat, Jianfei Cai. Open-Vocabulary Object Detection via Scene Graph Discovery. arxiv 2023. [paper]
  • OWL-ST: Matthias Minderer, Alexey Gritsenko, Neil Houlsby. Scaling Open-Vocabulary Object Detection. arxiv 2023. [paper]
  • Prannay Kaul, Weidi Xie, Andrew Zisserman. Multi-Modal Classifiers for Open-Vocabulary Object Detection. ICML 2023. [paper][code]
  • OpenSeeD: Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang. A Simple Framework for Open-Vocabulary Segmentation and Detection. arXiv 2023. [paper] [code]
  • Relja Arandjelović, Alex Andonian, Arthur Mensch, Olivier J. Hénaff, Jean-Baptiste Alayrac, Andrew Zisserman. Three Ways to Improve Feature Alignment for Open Vocabulary Eetection. arXiv 2023. [paper]
  • Prompt-OVD: Hwanjun Song, Jihwan Bang. Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection. arXiv 2023. [paper]
  • PCL: Han-Cheol Cho, Won Young Jhoo, Wooyoung Kang, Byungseok Roh. Open-Vocabulary Object Detection using Pseudo Caption Labels. arXiv 2023. [paper]
  • CORA: Xiaoshi Wu, Feng Zhu, Rui Zhao, Hongsheng Li. CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching. CVPR 2023. [paper] [code]
  • Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu. Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection. CVPR 2023. [paper] [code]
  • BARON: Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy. Aligning Bag of Regions for Open-Vocabulary Object Detection. CVPR 2023. [paper] [code]
  • RO-ViT: Dahun Kim, Anelia Angelova, Weicheng Kuo. Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers. CVPR 2023. [paper] [code]
  • DetCLIPv2: Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu. DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CVPR 2023. [paper]
  • CondHead: Tao Wang. Learning to Detect and Segment for Open Vocabulary Object Detection. CVPR 2023. [paper]
  • F-VLM: Weicheng Kuo, Yin Cui, Xiuye Gu, AJ Piergiovanni, Anelia Angelova. F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models. ICLR 2023. [paper] [code]
  • VLDet: Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai. Learning Object-Language Alignments for Open-Vocabulary Object Detection. ICLR 2023. [paper] [code]

2022

  • VTP-OVD: Yanxin Long, Jianhua Han, Runhui Huang, Xu Hang, Yi Zhu, Chunjing Xu, Xiaodan Liang. P3OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. arXiv 2022. [paper]
  • MEDet: Peixian Chen, Kekai Sheng, Mengdan Zhang, Yunhang Shen, Ke Li, Chunhua Shen. Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization. arXiv 2022. [paper] [code]
  • LocOV: Maria A. Bravo, Sudhanshu Mittal, Thomas Brox. Localized Vision-Language Matching for Open-vocabulary Object Detection. DAGM German Conference on Pattern Recognition (GCPR) 2022. [paper] [code]
  • Object-Centric-OVD: Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, Fahad Shahbaz Khan. Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection. NeurIPS 2022. [paper] [code]
  • VL-PLM: Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, Vijay Kumar B.G, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris Metaxas. Exploiting Unlabeled Data with Vision and Language Models for Object Detection. ECCV 2022. [paper] [code]
  • PromptDet: Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma. PromptDet: Towards Open-vocabulary Detection using Uncurated Images. ECCV 2022. [paper] [code]
  • OpenSeg: Golnaz Ghiasi, Xiuye Gu, Yin Cui, Tsung-Yi Lin. Scaling Open-Vocabulary Image Segmentation with Image-Level Labels. ECCV 2022. [paper] [code]
  • OV-DETR: Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy. Open-Vocabulary DETR with Conditional Matching. ECCV 2022. [paper] [code]
  • PB-OVD: Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong. Open Vocabulary Object Detection with Pseudo Bounding-Box Labels. ECCV 2022. [paper] [code]
  • OWL-ViT: Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby. Simple Open-Vocabulary Object Detection with Vision Transformers. ECCV 2022. [paper] [code]
  • RegionCLIP: Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao. RegionCLIP: Region-Based Language-Image Pretraining. CVPR 2022. [paper] [code]
  • XPM: Dat Huynh, Jason Kuen, Zhe Lin, Jiuxiang Gu, Ehsan Elhamifar. Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling. CVPR 2022. [paper] [code]
  • HierKD: Zongyang Ma, Guan Luo, Jin Gao, Liang Li, Yuxin Chen, Shaoru Wang, Congxuan Zhang, Weiming Hu. Open-Vocabulary One-Stage Detection With Hierarchical Visual-Language Knowledge Distillation. CVPR 2022. [paper] [code]
  • DetPro: Yu Du, Fangyun Wei, Zihe Zhang, Miaojing Shi, Yue Gao, Guoqi Li. Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model. CVPR 2022. [paper] [code]
  • ViLD: Xiuye Gu, Tsung-Yi Lin, Weicheng Kuo, Yin Cui. Open-vocabulary Object Detection via Vision and Language Knowledge Distillation. ICLR 2022. [paper] [code]

2021

  • OVR-CNN: Alireza Zareian, Kevin Dela Rosa, Derek Hao Hu, Shih-Fu Chang. Open-Vocabulary Object Detection Using Captions. CVPR 2021. [paper] [code]

About

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published