3 h ago에 업데이트됨

Organization

vLLM의 공개 GitHub 발자국

@vllm-project

GitHub에서 프로필 보기

공개 저장소

110,891

총 별점

3,436

팔로워

vllm-project는 GitHub에서 다양한 공개 리포지토리를 운영하는 조직입니다. 주요 프로그래밍 언어로는 Python, C++, Rust, Go, HTML, TypeScript가 있으며, vllm, vllm-omni, aibrix와 같은 주목할 만한 프로젝트들이 포함되어 있습니다. 이들은 LLM 모델의 효율적인 추론 및 배포를 위한 솔루션을 제공합니다.

주요 언어

Python 21C++ 3Rust 3Go 2HTML 2TypeScript 2JavaScript 1Shell 1

공개 저장소

vllm

★82,765

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

업데이트됨 2026년 6월 13일

vllm-omni

★5,130

A framework for efficient model inference with omni-modality models

Python

업데이트됨 2026년 6월 13일

aibrix

★4,875

Cost-efficient and pluggable Infrastructure components for GenAI inference

업데이트됨 2026년 6월 13일

semantic-router

★4,349

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

업데이트됨 2026년 6월 13일

llm-compressor

★3,392

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python

업데이트됨 2026년 6월 13일

production-stack

★2,401

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python

업데이트됨 2026년 6월 13일

vllm-ascend

★2,237

Community maintained hardware plugin for vLLM on Ascend

C++

업데이트됨 2026년 6월 13일

vllm-metal

★1,315

Community maintained hardware plugin for vLLM on Apple Silicon

Python

업데이트됨 2026년 6월 13일

guidellm

★1,252

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python

업데이트됨 2026년 6월 13일

recipes

★846

Common recipes to run vLLM

JavaScript

업데이트됨 2026년 6월 13일

speculators

★515

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python

업데이트됨 2026년 6월 13일

tpu-inference

★350

TPU inference for vLLM, with unified JAX and PyTorch support.

Python

업데이트됨 2026년 6월 13일

compressed-tensors

★292

A safetensors extension to efficiently store sparse quantized tensors on disk

Python

업데이트됨 2026년 6월 13일

router

★267

A high-performance and light-weight router for vLLM large scale deployment

Rust

업데이트됨 2026년 6월 11일

vime

★234

An LLM post-training framework with vLLM for RL Scaling

Python

업데이트됨 2026년 6월 13일

flash-attention

★125

Fast and memory-efficient exact attention

Python

업데이트됨 2026년 6월 13일

vllm-skills

★84

Agent skills for vLLM

Shell

업데이트됨 2026년 6월 13일

vllm-openvino

★54

이 저장소에 대한 설명이 제공되지 않았습니다.

Python

업데이트됨 2026년 5월 22일

vllm-daily

★51

vLLM Daily Summarization of Merged PRs

알 수 없는 언어

업데이트됨 2026년 6월 13일

vllm-xpu-kernels

★47

The vLLM XPU kernels for Intel GPU

C++

업데이트됨 2026년 6월 13일

vllm-project.github.io

★45

이 저장소에 대한 설명이 제공되지 않았습니다.

HTML

업데이트됨 2026년 6월 13일

ci-infra

★43

This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

HCL

업데이트됨 2026년 6월 12일

vllm-gaudi

★40

Community maintained hardware plugin for vLLM on Intel Gaudi

Python

업데이트됨 2026년 6월 12일

agentic-api

★33

Stateful API logic for agentic applications using vLLM

Rust

업데이트됨 2026년 6월 11일

vllm-neuron

★31

Community maintained hardware plugin for vLLM on AWS Neuron

Python

업데이트됨 2026년 5월 29일

dllm-plugin

★21

vLLM plugin for block-based diffusion language model (dLLM) support

Python

업데이트됨 2026년 6월 10일

vllm-nccl

★18

Manages vllm-nccl dependency

Python

업데이트됨 2026년 4월 14일

FlashMLA

★14

이 저장소에 대한 설명이 제공되지 않았습니다.

C++

업데이트됨 2026년 6월 1일

bart-plugin

★12

vLLM Model plugin for the encoder-decoder BART model

Python

업데이트됨 2026년 6월 3일

vLLM-in-PyTorch-Conference-2025

★11

이 저장소에 대한 설명이 제공되지 않았습니다.

알 수 없는 언어

업데이트됨 2026년 5월 26일

media-kit

★9

vLLM Logo Assets

알 수 없는 언어

업데이트됨 2026년 5월 27일

vllm-project.github.io-static

★9

이 저장소에 대한 설명이 제공되지 않았습니다.

HTML

업데이트됨 2025년 11월 26일

vllm-gguf-plugin

★8

vLLM Quantization plugin for GGUF

Python

업데이트됨 2026년 6월 13일

perf-eval

★7

Performance benchmark & accuracy evaluation for vLLM

Python

업데이트됨 2026년 6월 12일

vllm-dashboard

★4

이 저장소에 대한 설명이 제공되지 않았습니다.

TypeScript

업데이트됨 2026년 6월 11일

perf-dashboard

★3

Performance dashboard for vLLM

Python

업데이트됨 2026년 6월 11일

vllm-bnb-plugin

★1

vLLM Quantization plugin for bitsandbytes

Python

업데이트됨 2026년 6월 9일

rfcs

★1

이 저장소에 대한 설명이 제공되지 않았습니다.

알 수 없는 언어

업데이트됨 2025년 6월 4일

MSA

★0

이 저장소에 대한 설명이 제공되지 않았습니다.

알 수 없는 언어

업데이트됨 2026년 6월 11일

DeepGEMM

★0

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda

업데이트됨 2026년 6월 5일

vllm-docs

★0

이 저장소에 대한 설명이 제공되지 않았습니다.

TypeScript

업데이트됨 2026년 5월 21일

llm-multimodal

★0

Standalone fork of llm-multimodal from SMG

Rust

업데이트됨 2026년 5월 20일

자주 묻는 질문

vllm-project는 GitHub에서 무엇을 개발하나요?

vllm-project는 LLMs를 위한 추론 및 서빙 엔진을 비롯하여 다양한 모델 추론 프레임워크와 인프라 구성 요소를 개발합니다. 이들은 AI 및 머신러닝 분야에서 널리 사용되고 있습니다.

vllm-project에서 사용하는 프로그래밍 언어는 무엇인가요?

vllm-project의 주요 프로그래밍 언어는 Python, C++, Rust, Go, HTML, TypeScript입니다. 이러한 언어들은 다양한 프로젝트에서 효율적인 성능을 발휘하는 데 사용됩니다.

vllm-project의 리포지토리는 공개인가요?

네, vllm-project의 모든 리포지토리는 공개되어 있습니다. 이를 통해 사용자들은 다양한 프로젝트를 탐색하고 기여할 수 있습니다.

이 노출이 의도된 것인가요?

vLLM을 RepoGuard로 모니터링하고 새로운 공개 저장소가 나타나는 순간 알림을 받으세요.

이 계정 모니터링하기