Add Grounding DINO #2114

innat · 2023-10-23T07:31:21Z

Short Description

Zero-shot object detection model.

Papers

https://arxiv.org/abs/2303.05499

Existing Implementations

https://github.com/IDEA-Research/GroundingDINO

Other Information

Combination with (a). stable diffusion or (b). segment anything, etc, the applications possibility are huge.
pre-requisite:
- image backbone: swin-transformer
- text backbone: bert

innat · 2024-01-15T12:30:29Z

TODO
Components

Swin Transformer
Mult-scale Deform Attention, official-gdino, mmcv, official
DeformableTransformerEncoder/DecoderLayer
BiAttentionBlock (Bi-Direction MHA (text->image, image->text))

tirthasheshpatel · 2024-02-01T19:22:00Z

@innat Are you planning/volunteering to work on this or any of the components?

I see you proposed #2319 which seems like a replication of the SWIN transformer of the Grounding DINO implementation. Thanks for the PR!

This is next on my TODO list. Let me know if you want to take up something if you have time, I can help review and test! I can take the rest of the components and weights transfer. BTW the list of components with references is very useful, thanks!

innat · 2024-02-02T04:27:58Z

@tirthasheshpatel
The #2319 is about video-swin modelling, and I think the grounding-dino (g-dino) needs image-swin model, so this issue needs to be progressed first as a prerequisite of current issue. Here is one of the reimplementation of image-swin model in keras 2.

The above components of g-dino are some of high level components. But same as DETR, it also has custom cuda operations which might make complication to add. But other compoents can be added one by one initially. If you are currently working on it, please continue. If I could manage some time, I will contribute rest of the components. This kind of model (zsl detection) is quite useful and surly it will add value to keras-cv.

ianstenbit added models size:L labels Oct 23, 2023

tirthasheshpatel self-assigned this Oct 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Grounding DINO #2114

Add Grounding DINO #2114

innat commented Oct 23, 2023 •

edited

Loading

innat commented Jan 15, 2024

tirthasheshpatel commented Feb 1, 2024 •

edited

Loading

innat commented Feb 2, 2024

Add Grounding DINO #2114

Add Grounding DINO #2114

Comments

innat commented Oct 23, 2023 • edited Loading

innat commented Jan 15, 2024

tirthasheshpatel commented Feb 1, 2024 • edited Loading

innat commented Feb 2, 2024

innat commented Oct 23, 2023 •

edited

Loading

tirthasheshpatel commented Feb 1, 2024 •

edited

Loading