RepoGuard
refreshing…
OpenBMB

Organization

Public GitHub footprint of OpenBMB

@OpenBMB
View profile on GitHub

OpenBMB (Open Lab for Big Model Base) aims to build foundation models and systems towards AGI.

78

Public repositories

146,795

Total stars

6,624

Followers

OpenBMB maintains a substantial public GitHub presence, focusing on foundation models and systems aimed at advancing artificial general intelligence (AGI). The organization develops a wide range of repositories primarily in Python, JavaScript, and TypeScript, with notable projects such as ChatDev and VoxCPM showcasing their commitment to innovation in AI technologies.

Top languages

Python 57JavaScript 4TypeScript 4HTML 3Cuda 2C++ 2Jupyter Notebook 1Swift 1

Public repositories

ChatDev

33,381

ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration

Python
Updated Jun 13, 2026

VoxCPM

28,668

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python
Updated Jun 13, 2026

MiniCPM-V

25,607

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python
Updated Jun 13, 2026

MiniCPM

9,439

MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.

Jupyter Notebook
Updated Jun 13, 2026

XAgent

8,529

An Autonomous LLM Agent for Complex Task Solving

Python
Updated Jun 10, 2026

ToolBench

5,665

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python
Updated Jun 12, 2026

UltraRAG

5,583

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python
Updated Jun 13, 2026

AgentVerse

5,054

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript
Updated Jun 12, 2026

PilotDeck

3,220

Task-oriented AI Agent productivity platform

TypeScript
Updated Jun 13, 2026

BMTools

2,772

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python
Updated Jun 11, 2026

CPM-Bee

2,405

百亿参数的中英文双语基座大模型

Python
Updated Jun 11, 2026

AgentCPM-GUI

1,375

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python
Updated Jun 9, 2026

EdgeClaw

1,222

EdgeClaw: Edge-Cloud Collaborative Personal AI Assistant based on OpenClaw

TypeScript
Updated Jun 12, 2026

VisCPM

1,068

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python
Updated May 12, 2026

RepoAgent

986

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Python
Updated Jun 13, 2026

VisRAG

964

Parsing-free RAG supported by VLMs

Python
Updated Jun 11, 2026

ProAgent

864

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation

Python
Updated Jun 8, 2026

IoA

821

An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.

Python
Updated Jun 10, 2026

AgentCPM

800

An End-to-End Infrastructure for Training and Evaluating Various LLM Agents

Python
Updated Jun 8, 2026

BMTrain

624

Efficient Training (including pre-training and fine-tuning) for Big Models

Python
Updated May 26, 2026

BMInf

585

Efficient Inference for Big Models

Python
Updated May 31, 2026

DeepThinkVLA

525

DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models

Python
Updated Jun 5, 2026

CPM-Live

500

Live Training for Open-source Big Models

Python
Updated May 23, 2026

InfiniteBench

386

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python
Updated May 25, 2026

UltraFeedback

368

A large-scale, fine-grained, diverse preference dataset (and models).

Python
Updated May 25, 2026

BMList

345

A List of Big Models

Python
Updated Mar 20, 2026

Eurus

323

No description provided for this repository.

Python
Updated Jun 10, 2026

MiniCPM-V-Apps

306

MiniCPM-V apps — fully offline multimodal chat on iOS / Android / HarmonyOS

Swift
Updated Jun 12, 2026

UltraEval-Audio

303

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。

Python
Updated Jun 11, 2026

AppCopilot

293

A General, Accurate, Long-Horizon, and Efficient Mobile Agent driven by Multimodal Foundation Models

Python
Updated Jun 11, 2026

BMPrinciples

285

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future

Unknown Language
Updated May 7, 2026

MiniCPM-Desk-Pet

279

a local-first desktop pet powered by MiniCPM5

JavaScript
Updated Jun 13, 2026

ModelCenter

271

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python
Updated May 26, 2026

UltraEval

258

[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.

Python
Updated Mar 19, 2026

CPM.cu

241

CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge techniques in sparse architecture, speculative sampling and quantization.

Cuda
Updated Jun 2, 2026

RAGEval

233

No description provided for this repository.

Python
Updated Jun 8, 2026

MiniCPM-o-Demo

232

Official PyTorch+CUDA Full-functional Web Demo for MiniCPM-o 4.5

Python
Updated Jun 13, 2026

ForgeTrain

230

No description provided for this repository.

Python
Updated Jun 13, 2026

RLPR

203

Extrapolating RLVR to General Domains without Verifiers

Python
Updated May 19, 2026

OlympiadBench

196

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python
Updated Jun 8, 2026

WorkflowLLM

192

An open platform for enhancing the capability of LLMs in workflow orchestration.

Python
Updated Jun 10, 2026

ClawXRouter

188

No description provided for this repository.

TypeScript
Updated Jun 12, 2026

BMCook

169

Model Compression for Big Models

Python
Updated Jun 4, 2026

infllmv2_cuda_impl

102

No description provided for this repository.

Python
Updated May 30, 2026

MobileCPM

83

A Toolkit for Running On-device Large Language Models (LLMs) in APP

C++
Updated May 15, 2026

Tell_Me_More

66

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python
Updated Jun 4, 2026

ParamMute

58

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

Python
Updated May 25, 2026

DecT

53

Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding

Python
Updated May 24, 2026

ClawXMemory

44

ClawXMemory: A Multi-Level Memory Plugin for OpenClaw with Long-Term Context

TypeScript
Updated Jun 8, 2026

MoRE

43

[SIGIR '26] Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation

Python
Updated Jun 5, 2026

ArcLight

37

ArcLight: A Lightweight LLM Inference Framework

C++
Updated Jun 10, 2026

MetaMem

32

[ACL '26] This is the code repo for our ACL '26 Findings paper "MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization"

Python
Updated Jun 11, 2026

CPO

29

No description provided for this repository.

Python
Updated Apr 22, 2026

UltraLink

28

An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

Python
Updated Feb 8, 2026

DEBATER

27

This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search".

Python
Updated May 25, 2026

cpm_kernels

26

No description provided for this repository.

Python
Updated Feb 8, 2026

RAG-DDR

24

This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".

Python
Updated Feb 8, 2026

XAgent-doc

21

Document for XAgent.

Unknown Language
Updated Feb 23, 2026

ConsJudge

19

No description provided for this repository.

Python
Updated Mar 23, 2026

OpenAct

16

No description provided for this repository.

HTML
Updated Feb 19, 2026

BMInf-demos

16

BMInf demos.

JavaScript
Updated Feb 8, 2026

sglang

15

SGLang is a fast serving framework for large language models and vision language models.

Unknown Language
Updated May 2, 2026

SciCore-Mol

11

SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules

Python
Updated Jun 5, 2026

OmniEvalKit

10

OmniEvalKit is an evaluation framework designed for omni-modal large language models, with a focus on audio and audio-visual understanding. Based on OmniEvalKit, you can quickly reproduce benchmarks, implement your own models or datasets, and conduct fair comparisons with other open-source models. MiniCPM-o is evaluated using this framework.

Python
Updated Jun 7, 2026

DecorateLM

10

No description provided for this repository.

Python
Updated Feb 8, 2026

Scicore-Omics

9

SciCore-Omics the first tri-modal foundation model linking histology images, spatial transcriptomics, and biological language.

Python
Updated Jun 11, 2026

RaD-Agent

9

The official implementation of the Rational Decision-Making Agent with Internalized Utility Judgment

Python
Updated Jun 1, 2026

Omni-DuplexEval

8

No description provided for this repository.

Python
Updated Jun 4, 2026

General-Model-License

8

No description provided for this repository.

Unknown Language
Updated Feb 8, 2026

PAGER

6

No description provided for this repository.

Python
Updated Mar 22, 2026

AceBench

5

No description provided for this repository.

Python
Updated Jun 10, 2026

voxcpm2-demopage

5

No description provided for this repository.

JavaScript
Updated Jun 7, 2026

SOAR-Toolkit

5

No description provided for this repository.

Python
Updated Mar 22, 2026

VoxCPM-demopage

4

Demo page for VoxCPM

HTML
Updated Apr 12, 2026

Locret

4

No description provided for this repository.

Python
Updated Feb 8, 2026

HerculesBench

3

No description provided for this repository.

Python
Updated Feb 8, 2026

sparse_kernel

1

No description provided for this repository.

Cuda
Updated Mar 22, 2026

openbmb.github.io

0

No description provided for this repository.

HTML
Updated Feb 20, 2026

Frequently asked questions

What does OpenBMB build on GitHub?

OpenBMB builds a variety of projects on GitHub, focusing on foundation models and systems for artificial general intelligence. Their notable repositories include ChatDev, VoxCPM, and MiniCPM, which contribute to advancements in AI research and applications.

Which programming languages does OpenBMB use?

OpenBMB primarily uses Python, JavaScript, TypeScript, HTML, Cuda, and C++ for their development work. These languages support their diverse range of projects, including machine learning and AI-focused applications.

Are OpenBMB's repositories public?

Yes, OpenBMB's repositories are public on GitHub. This transparency allows others in the community to access and collaborate on their projects, facilitating knowledge sharing and innovation in the field of AI.

Is this exposure intended?

Monitor OpenBMB with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account