🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Bahasa Tidak Dikenal

Diperbarui 13 Apr 2026

Model-Optimizer

★0

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

Bahasa Tidak Dikenal

Diperbarui 6 Mei 2026

evalscope

★0

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python

Diperbarui 10 Apr 2026

transformers

★0

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Bahasa Tidak Dikenal

Diperbarui 7 Apr 2026

gpustack

★0

GPU cluster manager for optimized AI model deployment

Bahasa Tidak Dikenal

Diperbarui 8 Des 2025

sglang-npu

★0

SGLang is a fast serving framework for large language models and vision language models.

Bahasa Tidak Dikenal

Diperbarui 12 Agu 2025

Apakah paparan ini dimaksudkan?

Pantau kvcache.ai dengan RepoGuard dan dapatkan pemberitahuan saat repositori publik baru muncul.

Pantau akun ini