已更新 2 h ago

Organization

IDEA-Research 的公共 GitHub 足迹

@IDEA-Research

在 GitHub 上查看个人资料

The International Digital Economy Academy (“IDEA”).

China

公共仓库

57,671

总星标

2,910

关注者

IDEA-Research在GitHub上有着丰富的公开存在，专注于数字经济领域的研究。该组织的主要编程语言包括Python、Jupyter Notebook和TypeScript，其知名项目如Grounded-Segment-Anything和GroundingDINO在学术界和工业界都得到了广泛应用，展示了其在视觉识别和对象检测方面的技术实力。

顶级语言

Python 36Jupyter Notebook 4TypeScript 2C++ 1HTML 1

公共仓库

Grounded-Segment-Anything

★17,633

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook

已更新 2026年6月13日

GroundingDINO

★10,253

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python

已更新 2026年6月13日

Grounded-SAM-2

★3,585

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook

已更新 2026年6月12日

DINO

★2,816

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python

已更新 2026年6月13日

DWPose

★2,752

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Python

已更新 2026年6月13日

T-Rex

★2,681

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python

已更新 2026年6月8日

detrex

★2,295

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python

已更新 2026年6月5日

MaskDINO

★1,538

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python

已更新 2026年6月13日

Rex-Omni

★1,441

[CVPR2026] Detect Anything via Next Point Prediction

Jupyter Notebook

已更新 2026年6月12日

awesome-detection-transformer

★1,401

Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)

未知语言

已更新 2026年5月26日

DINO-X-API

★1,388

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python

已更新 2026年6月12日

Grounding-DINO-1.5-API

★1,123

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python

已更新 2026年6月10日

Motion-X

★870

[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"

Python

已更新 2026年6月12日

X-Pose

★808

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Python

已更新 2026年6月4日

OSX

★793

[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"

Python

已更新 2026年6月12日

OpenSeeD

★759

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Python

已更新 2026年6月10日

DN-DETR

★605

[CVPR 2022 Oral] Official implementation of DN-DETR

Python

已更新 2026年4月19日

DAB-DETR

★579

[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"

Jupyter Notebook

已更新 2026年5月25日

MotionLLM

★386

[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Python

已更新 2026年6月4日

HumanTOMATO

★363

[ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation

Python

已更新 2026年5月28日

HumanSD

★307

[ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"

Python

已更新 2026年4月11日

HumanArt

★282

[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"

未知语言

已更新 2026年5月27日

TAPTR

★280

[ECCV 2024 & NeurIPS 2024 & ICLR 2026] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3

未知语言

已更新 2026年6月12日

deepdataspace

★263

The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.

TypeScript

已更新 2026年5月19日

Stable-DINO

★242

[ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"

Python

已更新 2026年5月27日

ChatRex

★214

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python

已更新 2026年6月11日

Lite-DETR

★209

[CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"

Python

已更新 2026年4月18日

DreamWaltz

★190

[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".

Python

已更新 2026年5月6日

ED-Pose

★188

[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "

Python

已更新 2026年5月20日

3D-deformable-attention

★186

[ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"

Python

已更新 2026年6月12日

RexSeek

★183

[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark

Python

已更新 2026年5月15日

Rex-Thinker

★149

[ICLR-2026] Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python

已更新 2026年5月26日

MP-Former

★142

[CVPR 2023] Official implementation of the paper: MP-Former: Mask-Piloted Transformer for Image Segmentation

Python

已更新 2026年4月11日

SceneMaker

★134

[CVPR 2026] Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"

Python

已更新 2026年6月12日

DINO-X-MCP

★111

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

TypeScript

已更新 2026年6月11日