Aktualisiert vor 3 h

Organization

Öffentlicher GitHub-Footprint von vLLM

@vllm-project

Profil auf GitHub ansehen

Öffentliche Repositories

110.891

Sterne gesamt

3.436

Follower

Das vllm-project ist eine Organisation auf GitHub, die eine Vielzahl von öffentlichen Repositories pflegt, darunter bedeutende Projekte wie vllm und vllm-omni. Die Hauptprogrammiersprachen, die in diesen Repositories verwendet werden, sind Python, C++, Rust und Go, was auf die Vielseitigkeit der entwickelten Technologien hinweist.

Top-Sprachen

Python 21C++ 3Rust 3Go 2HTML 2TypeScript 2JavaScript 1Shell 1

Öffentliche Repositories

vllm

★82.765

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

Aktualisiert 13. Juni 2026

vllm-omni

★5.130

A framework for efficient model inference with omni-modality models

Python

Aktualisiert 13. Juni 2026

aibrix

★4.875

Cost-efficient and pluggable Infrastructure components for GenAI inference

Aktualisiert 13. Juni 2026

semantic-router

★4.349

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Aktualisiert 13. Juni 2026

llm-compressor

★3.392

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python

Aktualisiert 13. Juni 2026

production-stack

★2.401

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python

Aktualisiert 13. Juni 2026

vllm-ascend

★2.237

Community maintained hardware plugin for vLLM on Ascend

C++

Aktualisiert 13. Juni 2026

vllm-metal

★1.315

Community maintained hardware plugin for vLLM on Apple Silicon

Python

Aktualisiert 13. Juni 2026

guidellm

★1.252

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python

Aktualisiert 13. Juni 2026

recipes

★846

Common recipes to run vLLM

JavaScript

Aktualisiert 13. Juni 2026

speculators

★515

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python

Aktualisiert 13. Juni 2026

tpu-inference

★350

TPU inference for vLLM, with unified JAX and PyTorch support.

Python

Aktualisiert 13. Juni 2026

compressed-tensors

★292

A safetensors extension to efficiently store sparse quantized tensors on disk

Python

Aktualisiert 13. Juni 2026

router

★267

A high-performance and light-weight router for vLLM large scale deployment

Rust

Aktualisiert 11. Juni 2026

vime

★234

An LLM post-training framework with vLLM for RL Scaling

Python

Aktualisiert 13. Juni 2026

flash-attention

★125

Fast and memory-efficient exact attention

Python

Aktualisiert 13. Juni 2026

vllm-skills

★84

Agent skills for vLLM

Shell

Aktualisiert 13. Juni 2026

vllm-openvino

★54

Keine Beschreibung für dieses Repository vorhanden.

Python

Aktualisiert 22. Mai 2026

vllm-daily

★51

vLLM Daily Summarization of Merged PRs

Unbekannte Sprache

Aktualisiert 13. Juni 2026

vllm-xpu-kernels

★47

The vLLM XPU kernels for Intel GPU

C++

Aktualisiert 13. Juni 2026

vllm-project.github.io

★45

Keine Beschreibung für dieses Repository vorhanden.

HTML

Aktualisiert 13. Juni 2026

ci-infra

★43

This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

HCL

Aktualisiert 12. Juni 2026

vllm-gaudi

★40

Community maintained hardware plugin for vLLM on Intel Gaudi

Python

Aktualisiert 12. Juni 2026

agentic-api

★33

Stateful API logic for agentic applications using vLLM

Rust

Aktualisiert 11. Juni 2026

vllm-neuron

★31

Community maintained hardware plugin for vLLM on AWS Neuron

Python

Aktualisiert 29. Mai 2026

dllm-plugin

★21

vLLM plugin for block-based diffusion language model (dLLM) support

Python

Aktualisiert 10. Juni 2026

vllm-nccl

★18

Manages vllm-nccl dependency

Python

Aktualisiert 14. Apr. 2026

FlashMLA

★14

Keine Beschreibung für dieses Repository vorhanden.

C++

Aktualisiert 1. Juni 2026

bart-plugin

★12

vLLM Model plugin for the encoder-decoder BART model

Python

Aktualisiert 3. Juni 2026

vLLM-in-PyTorch-Conference-2025

★11

Keine Beschreibung für dieses Repository vorhanden.

Unbekannte Sprache

Aktualisiert 26. Mai 2026

media-kit

★9

vLLM Logo Assets

Unbekannte Sprache

Aktualisiert 27. Mai 2026

vllm-project.github.io-static

★9

Keine Beschreibung für dieses Repository vorhanden.

HTML

Aktualisiert 26. Nov. 2025

vllm-gguf-plugin

★8

vLLM Quantization plugin for GGUF

Python

Aktualisiert 13. Juni 2026

perf-eval

★7

Performance benchmark & accuracy evaluation for vLLM

Python

Aktualisiert 12. Juni 2026

vllm-dashboard

★4

Keine Beschreibung für dieses Repository vorhanden.

TypeScript

Aktualisiert 11. Juni 2026

perf-dashboard

★3

Performance dashboard for vLLM

Python

Aktualisiert 11. Juni 2026

vllm-bnb-plugin

★1

vLLM Quantization plugin for bitsandbytes

Python

Aktualisiert 9. Juni 2026

rfcs

★1

Keine Beschreibung für dieses Repository vorhanden.

Unbekannte Sprache

Aktualisiert 4. Juni 2025

MSA

★0

Keine Beschreibung für dieses Repository vorhanden.

Unbekannte Sprache

Aktualisiert 11. Juni 2026

DeepGEMM

★0

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda

Aktualisiert 5. Juni 2026

vllm-docs

★0

Keine Beschreibung für dieses Repository vorhanden.

TypeScript

Aktualisiert 21. Mai 2026

llm-multimodal

★0

Standalone fork of llm-multimodal from SMG

Rust

Aktualisiert 20. Mai 2026

Häufige Fragen

Was entwickelt das vllm-project auf GitHub?

Das vllm-project entwickelt eine Vielzahl von Tools für die effiziente Nutzung von großen Sprachmodellen (LLMs), einschließlich Frameworks wie vllm und vllm-omni, die sich auf Modellinferenz und -bereitstellung konzentrieren.

Welche Programmiersprachen verwendet das vllm-project?

Das vllm-project nutzt mehrere Programmiersprachen, hauptsächlich Python, C++, Rust und Go, um effiziente und leistungsstarke Softwarelösungen für die Verarbeitung von LLMs zu entwickeln.

Sind die Repositories des vllm-project öffentlich?

Ja, alle Repositories des vllm-project sind öffentlich zugänglich, was es Entwicklern und Interessierten ermöglicht, die Projekte einzusehen, beizutragen und von den veröffentlichten Ressourcen zu lernen.

Ist diese Sichtbarkeit gewollt?

Überwache vLLM mit RepoGuard und werde benachrichtigt, sobald ein neues öffentliches Repository auftaucht.

Diesen Account überwachen