refreshing…

Organization

Public GitHub footprint of IDEA-Research

@IDEA-Research

View profile on GitHub

The International Digital Economy Academy (“IDEA”).

China

Public repositories

57,666

Total stars

2,909

Followers

IDEA-Research has a significant public presence on GitHub, showcasing a wide range of repositories primarily in Python, Jupyter Notebook, and TypeScript. Notable projects include Grounded-Segment-Anything, GroundingDINO, and DWPose, focusing on advanced object detection and segmentation techniques, contributing to the field of digital economy research.

Top languages

Python 36Jupyter Notebook 4TypeScript 2C++ 1HTML 1

Public repositories

Grounded-Segment-Anything

★17,633

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook

Updated Jun 12, 2026

GroundingDINO

★10,252

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python

Updated Jun 12, 2026

Grounded-SAM-2

★3,585

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook

Updated Jun 12, 2026

DINO

★2,814

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python

Updated Jun 12, 2026

DWPose

★2,751

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Python

Updated Jun 11, 2026

T-Rex

★2,681

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python

Updated Jun 8, 2026

detrex

★2,295

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python

Updated Jun 5, 2026

MaskDINO

★1,537

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python

Updated Jun 10, 2026

Rex-Omni

★1,441

[CVPR2026] Detect Anything via Next Point Prediction

Jupyter Notebook

Updated Jun 12, 2026

awesome-detection-transformer

★1,401

Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)

Unknown Language

Updated May 26, 2026

DINO-X-API

★1,388

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python

Updated Jun 12, 2026

Grounding-DINO-1.5-API

★1,123

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python

Updated Jun 10, 2026

Motion-X

★870

[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"

Python

Updated Jun 12, 2026

X-Pose

★808

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Python

Updated Jun 4, 2026

OSX

★793

[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"

Python

Updated Jun 12, 2026

OpenSeeD

★759

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Python

Updated Jun 10, 2026

DN-DETR

★605

[CVPR 2022 Oral] Official implementation of DN-DETR

Python

Updated Apr 19, 2026

DAB-DETR

★579

[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"

Jupyter Notebook

Updated May 25, 2026

MotionLLM

★386

[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Python

Updated Jun 4, 2026

HumanTOMATO

★363

[ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation

Python

Updated May 28, 2026

HumanSD

★307

[ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"

Python

Updated Apr 11, 2026

HumanArt

★282

[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"

Unknown Language

Updated May 27, 2026

TAPTR

★280

[ECCV 2024 & NeurIPS 2024 & ICLR 2026] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3

Unknown Language

Updated Jun 12, 2026

deepdataspace

★263

The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.

TypeScript

Updated May 19, 2026

Stable-DINO

★242

[ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"

Python

Updated May 27, 2026

ChatRex

★214

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python

Updated Jun 11, 2026

Lite-DETR

★209

[CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"

Python

Updated Apr 18, 2026

DreamWaltz

★190

[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".

Python

Updated May 6, 2026

ED-Pose

★188

[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "

Python

Updated May 20, 2026

3D-deformable-attention

★186

[ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"

Python

Updated Jun 12, 2026

RexSeek

★183

[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

Python

Updated May 15, 2026

Rex-Thinker

★149

[ICLR-2026] Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python

Updated May 26, 2026

MP-Former

★142

[CVPR 2023] Official implementation of the paper: MP-Former: Mask-Piloted Transformer for Image Segmentation

Python

Updated Apr 11, 2026

SceneMaker

★134

[CVPR 2026] Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"

Python

Updated Jun 12, 2026

DINO-X-MCP

★111

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

TypeScript

Updated Jun 11, 2026

Click-Pose

★88

[ICCV 2023] Official implementation of the paper "Neural Interactive Keypoint Detection"

Python

Updated May 14, 2026

DiffHOI

★68

Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"

Python

Updated May 5, 2026

DQ-DETR

★59

[AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding

Unknown Language

Updated Apr 11, 2026

DisCo-CLIP

★59

Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".

Python

Updated Apr 11, 2026

V-Reflection

★58

Related code, checkpoints and project page for V-Reflection

Python

Updated May 29, 2026

SegDINO3D

★57

[AAAI 2026] Official implementation of the paper ”SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features“

Python

Updated Jun 9, 2026

LipsFormer

★44

No description provided for this repository.

Python

Updated Apr 11, 2026

TOSS

★24

[ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"

Python

Updated Apr 11, 2026

hana

★18

Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch

Python

Updated Apr 11, 2026

MotionCLR

★17

[Arxiv 2024] MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms

Python

Updated May 25, 2026

SegVGGT

★14

Official implementation of the paper "SegVGGT: Joint 3D Reconstruction and Instance Segmentation from Multi-View Images"

Python

Updated Jun 10, 2026

IYFC

★10

No description provided for this repository.

C++

Updated Apr 11, 2026

detrex-storage

★4

No description provided for this repository.

Unknown Language

Updated Apr 11, 2026

HandOSweb

★2

No description provided for this repository.

HTML

Updated Apr 11, 2026

Frequently asked questions

What does IDEA-Research build on GitHub?

IDEA-Research builds various projects on GitHub, focusing on object detection and segmentation. Their repositories include significant works like Grounded-Segment-Anything and GroundingDINO, which are used for advanced visual recognition tasks.

Which programming languages does IDEA-Research use?

IDEA-Research primarily uses Python, Jupyter Notebook, TypeScript, C++, and HTML in their public repositories. This diverse selection of languages supports their complex projects related to digital economy research.

Are IDEA-Research's repositories public?

Yes, all of IDEA-Research's repositories are public on GitHub. This transparency allows other researchers and developers to access their work, fostering collaboration and innovation in the field of digital economy and visual recognition.

Is this exposure intended?

Monitor IDEA-Research with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account