refreshing…

Organization

Public GitHub footprint of OpenBMB

@OpenBMB

View profile on GitHub

OpenBMB (Open Lab for Big Model Base) aims to build foundation models and systems towards AGI.

Public repositories

146,795

Total stars

6,624

Followers

OpenBMB maintains a substantial public GitHub presence, focusing on foundation models and systems aimed at advancing artificial general intelligence (AGI). The organization develops a wide range of repositories primarily in Python, JavaScript, and TypeScript, with notable projects such as ChatDev and VoxCPM showcasing their commitment to innovation in AI technologies.

Top languages

Python 57JavaScript 4TypeScript 4HTML 3Cuda 2C++ 2Jupyter Notebook 1Swift 1

Public repositories

ChatDev

★33,381

ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration

Python

Updated Jun 13, 2026

VoxCPM

★28,668

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python

Updated Jun 13, 2026

MiniCPM-V

★25,607

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python

Updated Jun 13, 2026

MiniCPM

★9,439

MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.

Jupyter Notebook

Updated Jun 13, 2026

XAgent

★8,529

An Autonomous LLM Agent for Complex Task Solving

Python

Updated Jun 10, 2026

ToolBench

★5,665

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python

Updated Jun 12, 2026

UltraRAG

★5,583

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python

Updated Jun 13, 2026

AgentVerse

★5,054

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript

Updated Jun 12, 2026

PilotDeck

★3,220

Task-oriented AI Agent productivity platform

TypeScript

Updated Jun 13, 2026

BMTools

★2,772

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python

Updated Jun 11, 2026

CPM-Bee

★2,405

百亿参数的中英文双语基座大模型

Python

Updated Jun 11, 2026

AgentCPM-GUI

★1,375

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python

Updated Jun 9, 2026

EdgeClaw

★1,222

EdgeClaw: Edge-Cloud Collaborative Personal AI Assistant based on OpenClaw

TypeScript

Updated Jun 12, 2026

VisCPM

★1,068

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python

Updated May 12, 2026

RepoAgent

★986

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Python

Updated Jun 13, 2026

VisRAG

★964

Parsing-free RAG supported by VLMs

Python

Updated Jun 11, 2026

ProAgent

★864

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation

Python

Updated Jun 8, 2026

IoA

★821

An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.

Python

Updated Jun 10, 2026

AgentCPM

★800

An End-to-End Infrastructure for Training and Evaluating Various LLM Agents

Python

Updated Jun 8, 2026

BMTrain

★624

Efficient Training (including pre-training and fine-tuning) for Big Models

Python

Updated May 26, 2026

BMInf

★585

Efficient Inference for Big Models

Python

Updated May 31, 2026

DeepThinkVLA

★525

DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models

Python

Updated Jun 5, 2026

CPM-Live

★500

Live Training for Open-source Big Models

Python

Updated May 23, 2026

InfiniteBench

★386

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python

Updated May 25, 2026

UltraFeedback

★368

A large-scale, fine-grained, diverse preference dataset (and models).

Python

Updated May 25, 2026

BMList

★345

A List of Big Models

Python

Updated Mar 20, 2026

Eurus

★323

No description provided for this repository.

Python

Updated Jun 10, 2026

MiniCPM-V-Apps

★306

MiniCPM-V apps — fully offline multimodal chat on iOS / Android / HarmonyOS

Swift

Updated Jun 12, 2026

UltraEval-Audio

★303

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测，知己知彼。

Python

Updated Jun 11, 2026

AppCopilot

★293

A General, Accurate, Long-Horizon, and Efficient Mobile Agent driven by Multimodal Foundation Models

Python

Updated Jun 11, 2026

BMPrinciples

★285

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future

Unknown Language

Updated May 7, 2026

MiniCPM-Desk-Pet

★279

a local-first desktop pet powered by MiniCPM5

JavaScript

Updated Jun 13, 2026

ModelCenter

★271

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python

Updated May 26, 2026

UltraEval

★258

[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.

Python

Updated Mar 19, 2026

CPM.cu

★241

CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge techniques in sparse architecture, speculative sampling and quantization.

Cuda

Updated Jun 2, 2026

RAGEval

★233

No description provided for this repository.

Python

Updated Jun 8, 2026

MiniCPM-o-Demo

★232

Official PyTorch+CUDA Full-functional Web Demo for MiniCPM-o 4.5

Python

Updated Jun 13, 2026

ForgeTrain

★230

No description provided for this repository.

Python

Updated Jun 13, 2026

RLPR

★203

Extrapolating RLVR to General Domains without Verifiers

Python

Updated May 19, 2026

OlympiadBench

★196

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python

Updated Jun 8, 2026

WorkflowLLM

★192

An open platform for enhancing the capability of LLMs in workflow orchestration.

Python

Updated Jun 10, 2026

ClawXRouter

★188

No description provided for this repository.

TypeScript

Updated Jun 12, 2026

BMCook

★169

Model Compression for Big Models

Python

Updated Jun 4, 2026

infllmv2_cuda_impl

★102

No description provided for this repository.

Python

Updated May 30, 2026

MobileCPM

★83

A Toolkit for Running On-device Large Language Models (LLMs) in APP

C++

Updated May 15, 2026

Tell_Me_More

★66

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python

Updated Jun 4, 2026

ParamMute

★58

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

Python

Updated May 25, 2026

DecT

★53

Source code for ACL 2023 paper Decoder Tuning: Efﬁcient Language Understanding as Decoding

Python

Updated May 24, 2026

ClawXMemory

★44

ClawXMemory: A Multi-Level Memory Plugin for OpenClaw with Long-Term Context

TypeScript

Updated Jun 8, 2026

★43

[SIGIR '26] Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation

Python

Updated Jun 5, 2026

ArcLight

★37

ArcLight: A Lightweight LLM Inference Framework

C++

Updated Jun 10, 2026

MetaMem

★32

[ACL '26] This is the code repo for our ACL '26 Findings paper "MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization"

Python

Updated Jun 11, 2026

CPO

★29

No description provided for this repository.

Python

Updated Apr 22, 2026

UltraLink

★28

An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

Python

Updated Feb 8, 2026

DEBATER

★27

This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search".

Python

Updated May 25, 2026

cpm_kernels

★26

No description provided for this repository.

Python

Updated Feb 8, 2026

RAG-DDR

★24

This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".

Python

Updated Feb 8, 2026

XAgent-doc

★21

Document for XAgent.

Unknown Language

Updated Feb 23, 2026

ConsJudge

★19

No description provided for this repository.

Python

Updated Mar 23, 2026

OpenAct

★16

No description provided for this repository.

HTML

Updated Feb 19, 2026

BMInf-demos

★16

BMInf demos.

JavaScript

Updated Feb 8, 2026

sglang

★15

SGLang is a fast serving framework for large language models and vision language models.

Unknown Language

Updated May 2, 2026

SciCore-Mol

★11

SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules

Python

Updated Jun 5, 2026

OmniEvalKit

★10

OmniEvalKit is an evaluation framework designed for omni-modal large language models, with a focus on audio and audio-visual understanding. Based on OmniEvalKit, you can quickly reproduce benchmarks, implement your own models or datasets, and conduct fair comparisons with other open-source models. MiniCPM-o is evaluated using this framework.

Python

Updated Jun 7, 2026

DecorateLM

★10

No description provided for this repository.

Python

Updated Feb 8, 2026

Scicore-Omics

★9

SciCore-Omics the first tri-modal foundation model linking histology images, spatial transcriptomics, and biological language.

Python

Updated Jun 11, 2026

RaD-Agent

★9

The official implementation of the Rational Decision-Making Agent with Internalized Utility Judgment

Python

Updated Jun 1, 2026

Omni-DuplexEval

★8

No description provided for this repository.

Python

Updated Jun 4, 2026

General-Model-License

★8

No description provided for this repository.

Unknown Language

Updated Feb 8, 2026

PAGER

★6

No description provided for this repository.

Python

Updated Mar 22, 2026

AceBench

★5

No description provided for this repository.

Python

Updated Jun 10, 2026

voxcpm2-demopage

★5

No description provided for this repository.

JavaScript

Updated Jun 7, 2026

SOAR-Toolkit

★5

No description provided for this repository.

Python

Updated Mar 22, 2026

VoxCPM-demopage

★4

Demo page for VoxCPM

HTML

Updated Apr 12, 2026

Locret

★4

No description provided for this repository.

Python

Updated Feb 8, 2026

HerculesBench

★3

No description provided for this repository.

Python

Updated Feb 8, 2026

sparse_kernel

★1

No description provided for this repository.

Cuda

Updated Mar 22, 2026

openbmb.github.io

★0

No description provided for this repository.

HTML

Updated Feb 20, 2026

Frequently asked questions

What does OpenBMB build on GitHub?

OpenBMB builds a variety of projects on GitHub, focusing on foundation models and systems for artificial general intelligence. Their notable repositories include ChatDev, VoxCPM, and MiniCPM, which contribute to advancements in AI research and applications.

Which programming languages does OpenBMB use?

OpenBMB primarily uses Python, JavaScript, TypeScript, HTML, Cuda, and C++ for their development work. These languages support their diverse range of projects, including machine learning and AI-focused applications.

Are OpenBMB's repositories public?

Yes, OpenBMB's repositories are public on GitHub. This transparency allows others in the community to access and collaborate on their projects, facilitating knowledge sharing and innovation in the field of AI.

Is this exposure intended?

Monitor OpenBMB with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account

Public GitHub footprint of OpenBMB

Top languages

Public repositories

ChatDev

VoxCPM

MiniCPM-V

MiniCPM

XAgent

ToolBench

UltraRAG

AgentVerse

PilotDeck

BMTools

CPM-Bee

AgentCPM-GUI

EdgeClaw

VisCPM

RepoAgent

VisRAG

ProAgent

IoA

AgentCPM

BMTrain

BMInf

DeepThinkVLA

CPM-Live

InfiniteBench

UltraFeedback

BMList

Eurus

MiniCPM-V-Apps

UltraEval-Audio

AppCopilot

BMPrinciples

MiniCPM-Desk-Pet

ModelCenter

UltraEval

CPM.cu

RAGEval

MiniCPM-o-Demo

ForgeTrain

RLPR

OlympiadBench

WorkflowLLM

ClawXRouter

BMCook

infllmv2_cuda_impl

MobileCPM

Tell_Me_More

ParamMute

DecT

ClawXMemory

MoRE

ArcLight

MetaMem

CPO

UltraLink

DEBATER

cpm_kernels

RAG-DDR

XAgent-doc

ConsJudge

OpenAct

BMInf-demos

sglang

SciCore-Mol

OmniEvalKit

DecorateLM

Scicore-Omics

RaD-Agent

Omni-DuplexEval

General-Model-License

PAGER

AceBench

voxcpm2-demopage

SOAR-Toolkit

VoxCPM-demopage

Locret

HerculesBench

sparse_kernel