The International Digital Economy Academy (“IDEA”).
49
Public repositories
57,666
Total stars
2,909
Followers
IDEA-Research has a significant public presence on GitHub, showcasing a wide range of repositories primarily in Python, Jupyter Notebook, and TypeScript. Notable projects include Grounded-Segment-Anything, GroundingDINO, and DWPose, focusing on advanced object detection and segmentation techniques, contributing to the field of digital economy research.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
[CVPR2026] Detect Anything via Next Point Prediction
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"
[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"
[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
[CVPR 2022 Oral] Official implementation of DN-DETR
[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos
[ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation
[ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
[ECCV 2024 & NeurIPS 2024 & ICLR 2026] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.
[ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"
Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
[CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"
[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".
[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "
[ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"
[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark
[ICLR-2026] Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning
[CVPR 2023] Official implementation of the paper: MP-Former: Mask-Piloted Transformer for Image Segmentation
[CVPR 2026] Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"
Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.
[ICCV 2023] Official implementation of the paper "Neural Interactive Keypoint Detection"
Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"
[AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".
Related code, checkpoints and project page for V-Reflection
[AAAI 2026] Official implementation of the paper ”SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features“
No description provided for this repository.
[ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"
Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch
[Arxiv 2024] MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms
Official implementation of the paper "SegVGGT: Joint 3D Reconstruction and Instance Segmentation from Multi-View Images"
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
IDEA-Research builds various projects on GitHub, focusing on object detection and segmentation. Their repositories include significant works like Grounded-Segment-Anything and GroundingDINO, which are used for advanced visual recognition tasks.
IDEA-Research primarily uses Python, Jupyter Notebook, TypeScript, C++, and HTML in their public repositories. This diverse selection of languages supports their complex projects related to digital economy research.
Yes, all of IDEA-Research's repositories are public on GitHub. This transparency allows other researchers and developers to access their work, fostering collaboration and innovation in the field of digital economy and visual recognition.
Monitor IDEA-Research with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account