DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Example models using DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
此仓库未提供描述。
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.