A simple Python API (built on top of TensorFlow) for neural image captioning with MSCOCO data.
-
Updated
Aug 30, 2021 - Python
A simple Python API (built on top of TensorFlow) for neural image captioning with MSCOCO data.
COCOA: Semantic Amodal Segmentation for huggingface datasets
COCO-Stuff dataset for huggingface datasets
A deep-learning object detection project pre-trained on COCO dataset
FQDet: Fast-converging Query-based Detector
Microsoft COCO: Common Objects in Context for huggingface datasets
Source code of our KDD 2024 paper "Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning"
Object Detection Dataset Format Converter
A helper library for easily converting MSCOCO format data using the loading script of huggingface datasets.
Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
Image Caption Generator using a Pretrained ResNet-50 and an LSTM architecture. Trained on COCO 2017 dataset, it's accessible via a Streamlit app.
PyTorch implementation of SSD: Single Shot MultiBox Detector.
Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"
LabelMe to MsCOCO, PascalVOC, Yolo
An ongoing research project on image entropy assessment using machine learning.
Add a description, image, and links to the mscoco topic page so that developers can more easily learn about it.
To associate your repository with the mscoco topic, visit your repo's landing page and select "manage topics."