已更新 3 h ago

Organization

vLLM 的公共 GitHub 足迹

@vllm-project

在 GitHub 上查看个人资料

公共仓库

110,891

总星标

3,436

关注者

vllm-project 是一个在 GitHub 上活跃的组织，专注于大规模语言模型的推理和服务。该组织的公共代码库涵盖多种编程语言，包括 Python、C++、Rust 和 Go，拥有一系列广泛使用的项目，如 vllm、vllm-omni 和 aibrix，展示了其在人工智能领域的贡献。

顶级语言

Python 21C++ 3Rust 3Go 2HTML 2TypeScript 2JavaScript 1Shell 1

公共仓库

vllm

★82,765

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

已更新 2026年6月13日

vllm-omni

★5,130

A framework for efficient model inference with omni-modality models

Python

已更新 2026年6月13日

aibrix

★4,875

Cost-efficient and pluggable Infrastructure components for GenAI inference

已更新 2026年6月13日

semantic-router

★4,349

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

已更新 2026年6月13日

llm-compressor

★3,392

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python

已更新 2026年6月13日

production-stack

★2,401

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python

已更新 2026年6月13日

vllm-ascend

★2,237

Community maintained hardware plugin for vLLM on Ascend

C++

已更新 2026年6月13日

vllm-metal

★1,315

Community maintained hardware plugin for vLLM on Apple Silicon

Python

已更新 2026年6月13日

guidellm

★1,252

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python

已更新 2026年6月13日

recipes

★846

Common recipes to run vLLM

JavaScript

已更新 2026年6月13日

speculators

★515

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python

已更新 2026年6月13日

tpu-inference

★350

TPU inference for vLLM, with unified JAX and PyTorch support.

Python

已更新 2026年6月13日

compressed-tensors

★292

A safetensors extension to efficiently store sparse quantized tensors on disk

Python

已更新 2026年6月13日

router

★267

A high-performance and light-weight router for vLLM large scale deployment

Rust

已更新 2026年6月11日

vime

★234

An LLM post-training framework with vLLM for RL Scaling

Python

已更新 2026年6月13日

flash-attention

★125

Fast and memory-efficient exact attention

Python

已更新 2026年6月13日

vllm-skills

★84

Agent skills for vLLM

Shell

已更新 2026年6月13日

vllm-openvino

★54

此仓库未提供描述。

Python

已更新 2026年5月22日

vllm-daily

★51

vLLM Daily Summarization of Merged PRs

未知语言

已更新 2026年6月13日

vllm-xpu-kernels

★47

The vLLM XPU kernels for Intel GPU

C++

已更新 2026年6月13日

vllm-project.github.io

★45

此仓库未提供描述。

HTML

已更新 2026年6月13日

ci-infra

★43

This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

HCL

已更新 2026年6月12日

vllm-gaudi

★40

Community maintained hardware plugin for vLLM on Intel Gaudi

Python

已更新 2026年6月12日

agentic-api

★33

Stateful API logic for agentic applications using vLLM

Rust

已更新 2026年6月11日

vllm-neuron

★31

Community maintained hardware plugin for vLLM on AWS Neuron

Python

已更新 2026年5月29日

dllm-plugin

★21

vLLM plugin for block-based diffusion language model (dLLM) support

Python

已更新 2026年6月10日

vllm-nccl

★18

Manages vllm-nccl dependency

Python

已更新 2026年4月14日

FlashMLA

★14

此仓库未提供描述。

C++

已更新 2026年6月1日

bart-plugin

★12

vLLM Model plugin for the encoder-decoder BART model

Python

已更新 2026年6月3日

vLLM-in-PyTorch-Conference-2025

★11

此仓库未提供描述。

未知语言

已更新 2026年5月26日

media-kit

★9

vLLM Logo Assets

未知语言

已更新 2026年5月27日

vllm-project.github.io-static

★9

此仓库未提供描述。

HTML

已更新 2025年11月26日

vllm-gguf-plugin

★8

vLLM Quantization plugin for GGUF

Python

已更新 2026年6月13日

perf-eval

★7

Performance benchmark & accuracy evaluation for vLLM

Python

已更新 2026年6月12日

vllm-dashboard

★4

此仓库未提供描述。

TypeScript

已更新 2026年6月11日

perf-dashboard

★3

Performance dashboard for vLLM

Python

已更新 2026年6月11日

vllm-bnb-plugin

★1

vLLM Quantization plugin for bitsandbytes

Python

已更新 2026年6月9日

rfcs

★1

此仓库未提供描述。

未知语言

已更新 2025年6月4日

MSA

★0

此仓库未提供描述。

未知语言

已更新 2026年6月11日

DeepGEMM

★0

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda

已更新 2026年6月5日

vllm-docs

★0

此仓库未提供描述。

TypeScript

已更新 2026年5月21日

llm-multimodal

★0

Standalone fork of llm-multimodal from SMG

Rust

已更新 2026年5月20日

常见问题

vllm-project 在 GitHub 上构建了什么？

vllm-project 在 GitHub 上构建了一系列与大规模语言模型相关的项目，主要包括推理和服务引擎，以及高效的模型推理框架。这些项目旨在为不同的应用场景提供解决方案。

vllm-project 使用哪些编程语言？

vllm-project 主要使用 Python、C++、Rust 和 Go 等编程语言。这些语言的多样性反映了该组织在构建高效和可扩展系统方面的能力。

vllm-project 的代码库是公开的吗？

是的，vllm-project 的所有代码库都是公开的。这意味着任何人都可以访问、审计和贡献这些项目，从而促进了开源社区的发展。

这种曝光是有意的吗？

使用 RepoGuard 监控 vLLM，并在新公共仓库出现的瞬间提醒您。

监控此账户