The sgl-project organization has a notable public GitHub presence with a variety of repositories primarily focusing on Python, Go, and C++. Key projects include SGLang, a framework for large language models, and mini-sglang, which simplifies complex LLM serving systems. Their repositories serve as valuable resources in the field of machine learning and model serving.
SGLang is a high-performance serving framework for large language models and multimodal models.
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
Materials for learning SGLang
SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
JAX backend for SGL
A workload for deploying LLM inference services on Kubernetes
SGLang kernel library for NPU
This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
SGLang kernel library for Intel XPU
SGLang Kernel Wheel Index
Fast and memory-efficient exact attention
SGLang wheels for multiple platforms
Cookbook of SGLang - Recipe
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
Fast Hadamard transform in CUDA, with a PyTorch interface
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
The test files for SGLang.
FlashMLA: Efficient Multi-head Latent Attention Kernels
No description provided for this repository.
sgl-project develops a range of repositories focused on model serving frameworks, including SGLang and mini-sglang. Their work facilitates the deployment and training of large language models and includes tools for speculative decoding and benchmarking.
sgl-project primarily uses Python, Go, and C++. Other languages used in their repositories include C++, Jupyter Notebook, and Cuda. This diverse set of languages supports their focus on machine learning and model serving technologies.
Yes, all of sgl-project's repositories are public on GitHub. This transparency allows users and developers to access their projects, contribute to their development, and utilize the resources provided for learning and implementation.
Monitor sgl-project with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account