Skip to content
@IDEA-Research

IDEA-Research

The International Digital Economy Academy (“IDEA”).

Pinned Loading

  1. Grounded-Segment-Anything Public

    Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

    Jupyter Notebook 16.5k 1.5k

  2. detrex Public

    detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

    Python 2.2k 226

  3. GroundingDINO Public

    [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

    Python 8.3k 837

  4. OpenSeeD Public

    [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

    Python 715 46

  5. MaskDINO Public

    [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

    Python 1.3k 127

  6. DINO Public

    [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

    Python 2.5k 284

Repositories

Showing 10 of 44 repositories
  • DINO-X-MCP Public

    Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

    TypeScript 11 Apache-2.0 0 0 0 Updated Jun 23, 2025
  • DINO-X-API Public

    DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

    Python 1,092 Apache-2.0 44 25 1 Updated Jun 20, 2025
  • Rex-Thinker Public

    Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

    Python 42 2 2 0 Updated Jun 9, 2025
  • Grounded-SAM-2 Public

    Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

    Jupyter Notebook 2,342 Apache-2.0 249 41 3 Updated May 27, 2025
  • T-Rex Public

    [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

    Python 2,519 163 13 0 Updated Apr 22, 2025
  • RexSeek Public

    Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

    Python 137 9 7 0 Updated Apr 14, 2025
  • 3D-deformable-attention Public

    [ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"

    Python 162 4 1 0 Updated Apr 12, 2025
  • HandOSweb Public
    HTML 1 0 0 0 Updated Mar 19, 2025
  • Motion-X Public

    [NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"

    Python 704 21 76 0 Updated Mar 3, 2025
  • ChatRex Public

    Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

    Python 187 6 9 0 Updated Jan 24, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.