You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@innat Are you planning/volunteering to work on this or any of the components?
I see you proposed #2319 which seems like a replication of the SWIN transformer of the Grounding DINO implementation. Thanks for the PR!
This is next on my TODO list. Let me know if you want to take up something if you have time, I can help review and test! I can take the rest of the components and weights transfer. BTW the list of components with references is very useful, thanks!
@tirthasheshpatel
The #2319 is about video-swin modelling, and I think the grounding-dino (g-dino) needs image-swin model, so this issue needs to be progressed first as a prerequisite of current issue. Here is one of the reimplementation of image-swin model in keras 2.
The above components of g-dino are some of high level components. But same as DETR, it also has custom cuda operations which might make complication to add. But other compoents can be added one by one initially. If you are currently working on it, please continue. If I could manage some time, I will contribute rest of the components. This kind of model (zsl detection) is quite useful and surly it will add value to keras-cv.
Short Description
Zero-shot object detection model.
Papers
https://arxiv.org/abs/2303.05499
Existing Implementations
https://github.com/IDEA-Research/GroundingDINO
Other Information
The text was updated successfully, but these errors were encountered: