Skip to content
@IDEA-Research

IDEA-Research

The International Digital Economy Academy (“IDEA”).

Pinned Loading

  1. Grounded-Segment-Anything Public

    Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

    Jupyter Notebook 15.9k 1.5k

  2. detrex Public

    detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

    Python 2.1k 220

  3. GroundingDINO Public

    [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

    Python 7.6k 771

  4. OpenSeeD Public

    [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

    Python 692 43

  5. MaskDINO Public

    [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

    Python 1.3k 116

  6. DINO Public

    [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

    Python 2.4k 276

Repositories

Showing 10 of 41 repositories
  • RexSeek Public

    Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

    Python 34 1 0 0 Updated Mar 12, 2025
  • Motion-X Public

    [NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"

    Python 647 18 76 0 Updated Mar 3, 2025
  • ChatRex Public

    Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

    Python 164 8 7 0 Updated Jan 24, 2025
  • Grounding-DINO-1.5-API Public

    Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

    Python 911 Apache-2.0 34 34 0 Updated Jan 21, 2025
  • DINO-X-API Public

    DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

    Python 918 Apache-2.0 38 18 0 Updated Jan 21, 2025
  • Grounded-SAM-2 Public

    Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

    Jupyter Notebook 1,828 Apache-2.0 174 35 1 Updated Dec 21, 2024
  • TAPTR Public

    [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3

    258 14 0 0 Updated Dec 13, 2024
  • HandOSweb Public
    HTML 1 0 0 0 Updated Dec 3, 2024
  • 3 Apache-2.0 0 0 0 Updated Oct 24, 2024
  • T-Rex Public

    [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

    Python 2,419 160 9 0 Updated Oct 21, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.